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ABSTRACT 


This  thesis  provides  the  computer  software  to  perform  24 
of  the  aost  common  nonparametric  tests,  and  a  text 
explaining  how  and  why  to  use  that  software. 

Honparametric  tests  are  valuable  to  experimenters  and 
operations  analysts  for  three  reasons:  ease  of  explanation 
to  non- statisticians ,  simplicity  of  computation,  and  appli¬ 
cability  to  data  sets  which  cannot  be  analyzed  by  parametric 
tests.  Seasons  for  unsuitability  of  parametric  test  include 
data  of  nominal  or  ordinal  level  of  measurement,  lack  of 
common  variance,  or  lack  of  normal  distribution  of  the 
underlying  population. 

There  are  two  programs  provided:  Lochinvar,  an  interac¬ 
tive  Pascal  program,  and  Crunch,  a  Fortran  program. 
Lochinvar  is  designed  to  prompt  and  screen  a  user’s  entry  of 
data  and  options.  Crunch  performs  calculations;  it  is  suit¬ 
able  for  use  in  whole  or  part  as  a  subcomponent  of  other 
programs. 
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I.  IS  TR0DCTCTI3W 

This  thesis  is  intended  to  help  non-statiticians  use 
nonparaaetric  tests  to  thair  fullest.  The  text  explains 
when  nonparaastric  tests  are  preferable  to  parametric  tests, 
how  to  choose  the  most  appropriate  test,  and  how  to  use  the 
nonparaaetric  software  package. 

Parametric  tests  are  often  the  preferred  form  of  statis¬ 
tics  because  of  their  acceptance  in  the  academic  world,  the 
availability  of  computer  support,  and  flexibility  in  model 
building.  nonparaaetric  tests  provide  a  valuable  alterna¬ 
tive  to  paraaetrics,  because  they  may  be  applied  to  data 
which,  for  any  one  of  a  number  of  reasons,  cannot  be 
analyzed  with  parametric  tests.  The  simplicity  of  nonparame- 
tric  tests  often  sake  the  test  statistics  easier  to  compute 
and  test  results  easier  to  explain.  Section  II  gives  a 
detailed  comparison  of  parametric  versus  nonparametric 
tests. 

When  confronted  with  the  same  hypothesis,  various  para¬ 
metric  tests  give  consistent  results  because  each  test 
extracts  the  same  information  from  the  data  set. 
Nonparametric  tests  may  yield  varying  results  with  the  same 
data,  because  different  tests  extract  different  information 
from  the  data  set.  To  overcome  thLs  potential  problem,  the 
most  appropriate  nonparaaetric  test  should  be  chosen  to 
analyze  the  data  set.  section  IV  gives  guidance  on 
selecting  the  best  test. 

In  order  to  make  using  nonparaaetric  tests  as  easy  as 
possible,  two  programs  were  written  to  perform  the  most 
common  24  nonparametric  tests,  k  general  description  of  how 
to  use  the  programs  is  contained  in  section  III.  Sections  v 
through  XXVIII  contain  descriptions  of  each  of  the  tests. 


and  detailed  instructions  on  their  use.  The  appendices 
contain  aore  detailed  information  about  the  programs. 
Appendix  A  is  a  tutorial  which  depicts  use  of  the  programs 
to  perform  a  test  on  a  data  set.  Appendices  3  through  E 
describe  the  two  programs  and  two  files.  Appendix  s 
contains  the  interactive  Pascal  program,  Lochinvar,  while 
Appendix  F  contains  the  Portran  program  Crunch. 


II.  COMPARISON  OF  PARAMETRIC  VERSOS  NONP A£A MET RIC  TESTS 

This  section  contrasts  the  nonparametric  statistical 
test  with  parametric  statistics.  To  do  this,  there  is  a 
short  summary  addressing  various  levels  of  measurement  and 
a  brief  discussion  on  the  relationship  between  the  aal 
distribution  and  interval  data.  Finally  there  is  a  c  ar- 
ison  of  the  advantages  of  the  two  classes  of  tests.  '  ,„s 
will  help  the  reader  decide  whether  to  use  the  tests  cf  this 
text. 


A.  LEVELS  OF  MEASUREMENT 

Statisics  manipulate  raw  data  to  extract  whatever  infor¬ 
mation  that  .  lata  contains.  Level  of  measurement  indicate 
how  much  information  is  contained  in  the  data,  and  how  that 
information  may  be  manipulated.  Although  formally  there  are 
four  levels  of  measurement,  as  defined  by  Stevens  [Ref.  2], 
there  are  only  three  which  matter  when  choosing  the  test: 
nominal,  ordinal,  and  interval.  As  a  consistent  rule  any 
information  contained  in  data  of  a  lower  level  of  measure¬ 
ment  is  contained  in  data  of  a  higher  level;  any  manipula¬ 
tion  permitted  on  a  lower  level  is  permitted  on  a  higher 
level. 

Nominal  data  elements  contain  very  little  information: 
only  membership  in  a  subset  of  the  total  data  set  can  be 
determined.  Manipulation  of  the  nominal  data  sets  is 
limited  to  counting  the  number  of  elements  in  each  subset. 

Ordinal  data  elements  contain  information  relative  to 
other  elements:  element  A  is  either  less  than,  greater  than, 
or  equal  to  each  other  elament  in  the  set.  However,  the 
question  "greater  than  by  how  much"  cannot  be  answered.  The 
data  set  nay  be  ranked. 


Interval  data  elements  contain  a  or  a  information.  Not 
only  can  eleient  A  be  compared  to  other  elements,  bu~ 
inequalities  can  be  quantified;  e.g.,  A>B  implies  that  there 
exists  a  k  such  that  A=B+k,  and  that  the  k  can  be  found  by 
k=A- 3 .  The  data  set  can  be  manipulated  with  normal  arith¬ 
metic  operations,  such  as  addition,  division,  or  raising  to 
a  power. 


B.  LEVEL  OF  MEASUREMENT  AND  DISTRIBUTION  ASSUMPTIONS 

The  normal  distribution  is  completely  specified  by  two 
parameters:  expected  value  and  variance.  In  estimating 
those  parameters  form  a  data  set,  arithmetic  operations  are 
used,  and  intervals  are  treated  as  though  they  were  mean¬ 
ingful.  For  example,  the  expected  value  is  estimated  by  the 
sample  mean,  which  is  found  by  summing  the  data  elements  and 
dividing  by  the  number  of  elements.  The  variance  is  found 
by  summing  the  squares  of  the  intervals  between  each  data 
element  and  the  sample  mean,  and  dividing  by  the  number  of 
elements  minus  one.  Because  of  the  use  of  arithmetic  opera¬ 
tions  and  intervals  in  computing  its  parameters,  the  normal 
distributon  seems  to  require  interval  data  sets. 

Parametric  tests  assume  that  a  data  set  contains 
elements  with  an  interval  level  of  measurement  drawn  from  a 
normally  distributed  population.  Comparison  of  two  data 
sets  usually  require  the  assumption  of  equal  variance  in  the 
two  populations  from  which  the  samples  were  drawn.  There  is 
a  fair  amount  of  literature  concerning  how  valid  parametric 
tests  are  when  any  of  these  assumptions  are  violated;  a 
large,  vocal  group  of  statisticians  would  say  that  such 
tests  produce  valid  results  even  when  the  assumptions  are 
decidedly  not  met.  But  meeting  aLL  of  the  assumptions  of 
parametric  tests  are  the  guarantee  of  validity  when  usina 
such  test. 
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Nonparametric  tests  make  no  assumptions  about  the  under¬ 
lying  distribution  of  the  population  from  which  the  data  is 
drawn.  Additionally,  there  ara  nonparametric  tests  to 
analyze  data  with  nominal  level  of  measurement  or  higher. 
Because  some  nonparametric  tests  require  only  nominal  or 
ordinal  data,  and  don't  make  assumptions  about  population 
distributions,  the  power  of  those  tests  may  be  less  than 
that  of  parametric  tests —  assuming  that  the  data  set  justi¬ 
fied  use  of  parametrics  in  the  first  place. 

C.  ADVANTAGES  AND  DISADVANTAGES  OF  CLASSES  OF  TESTS 

The  advantages  and  disadvantages  of  parametric  tests  are 
the  mirror  image  of  those  of  nonparametric  test.  Therefore, 
the  advantages  of  each  class  of  tests  is  given. 

1 .  Advantages  of  Parametric  Tests 

The  general  advantages. of  parametric  tests  are  wide 
acceptance  of  methodology,  consistency  of  results,  and  capa¬ 
bility  to  handle  multipls  variables  in  systematic  model 
building. 

a.  wide  Acceptance 

The  wide  acceptance  of  parametric  test  has  a 
number  of  advantages.  Statistical  software  packages  contain 
a  very  complete  set  of  parametric  tests,  and  only  a  small — 
although  increasing  over  time — number  of  nonparametric 
tests.  Table  of  the  theoretical  distributions  needed  to 
judge  test  statistics  (the  normal,  student’s  t,  chi-square 
and  F)  are  extensive  and  ubiquitous.  The  terms  of  the  test 
are  known  to  a  majority  of  persons  trained  in  the  social  or 
physical  sciences;  whether  the  terms  are  fully  understood 
may  not  influence  the  value  of  their  recognition. 


b.  Consistency  of  Results 

Different  parametric  te.»ts  give  the  same  conclu¬ 
sions,  given  the  same  null  and  alternative  hypotheses. 
Parametric  tests  all  extract  the  same  information  from  the 
data,  and  merely  rearrange  it  for  different  purposes.  The 
parametric  distributions  are  all  derived  from  the  normal.' 
Bith  related  test  statistics  and  similar  test  distributions 
for  critical  values,  it  is  not  surprising  that  different 
techniques  give  consistent  answers  to  the  same  question. 

Non  para  me  tries  may  provide  different  results  to 
the  same  question  on  the  same  data  set,  because  different 
tests  extract  different  types  of  information  from  the  data 
set.  For  example,  with  paired  data  elements,  the  sign  test 
considers  only  which  member  of  the  pair  is  larger;  the 
signed  rank  test  needs  to  know  not  only  which  is  larger,  but 
also  the  magnitude  of  that  difference.  Consistency  of 
results  is  therefor  a  possible  advantage  for  parametric 
tests.  However,  this  advantage  can  be  minimized  by 
selecting  the  best  (most  powerful)  nonparametric  test 
possible,  as  discussed  in  a  later  section. 

c.  Versatility  in  Hodel  Building 

The  final,  and  probably  most  important,  advantage 
of  parametrics  is  their  versatility  in  explaning  very 
complex  interactions  in  data.  Analysis  of  variance  can  test 
differences  in  treatment  effects,  multiple  blocking  effects, 
and  interaction  among  treatments  and  blockings.  Regression 
can  be  made  to  do  analysis  of  variance,  (by  using  nothing 
but  dichotomous  variables)  ,  but  is  even  more  flexible,  by 
building  models  using  variables  with  continuous  values 
rather  than  only  a  fixed  number  of  treatment  levels. 


i.i$S 


knowledge  of  the  population  expected  value  is  corrected  for 
by  lowered  kurtosis;  the  chi-square  distribution  of  degrees 
of  freedom  n  is  the  sum  of  squares  of  n  standard  normal 
distributions;  the  P  is  the  ratio  of  two  chi-squares. 


This  versatility  of  parametrics  is  based  on  the 
very  strong  assueptions  made  about  the  populations  from 
which  the  data  is  drawn:  literature  beyond  the  elementary 

statistics  textbook  level  contains  many  ways  to  determine  if 
the  assumptions  are  met,  and  what  to  do  when  they  are  not 
met. 

Nonparametric  tests,  on  the  other  hand,  assume 
nothing  about  the  underlying  distribution,  and  often  work 
with  nominal  or  ordinal  data.  Not  surprisingly,  only  very 
straight-forward  answers  can  be  given  for  any  one  test.  A 
clever  application  of  a  sequence  of  tests  to  the  same  data 
set  may,  however,  yield  answers  to  a  number  of  different 
questions,  and  provide  some  inferences  on  interaction. 

2.  Aflia teases  HoaPirametric  rests 

The  advantages  of  nonparametric  tests  are  simplicity 
and  applicability. 

a.  Simplicity 

Nonparametric  tests*  simplicity  allows  for  rela¬ 
tively  easy  calculation  of  the  test  statistic  and  critical 
value  for  a  null  hypothesis.  This  simplicity  also  allows 
nonparametric  tests  to  be  explained  to,  and  defended  in 
front  of,  persons  who  are  not  trained  in  statistics. 

The  test  statistic  for  most  nonparametic  tests 
can  be  calculated  by  hand.  For  example,  a  set  of  20  pairs 
of  data  elements  (X,T)  can  be  compared  for  equality  of 
expected  value  (E(X)*E(T))  by  either  the  t-test  or  the 
nonparametric  sign  test.  The  t-tast  needs  sample  means  and 
variances,  so  some  arithmetic  operations  are  required:  four 
sums  of  20  terms  each,  40  squaring  of  numbers,  four  divi¬ 
sions.  In  contrast,  the  sign  test  requires  counting  in  how 
many  pairs  the  X  element  is  greater  than  the  T  element.  The 
contrast  is  not  always  so  one  sided;  as  the  amount  of  infor¬ 
mation  extracted  from  the  data  incraasas,  so  does  complexity 
of  computation.  For  example,  rank-order  nonparametric  tests 
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require  ranking  of  data  elements,  which  can  becoae  tedious 
in  large  data  sets.  However,  the  process  of  ranlcing  tends 
to  be  self-correcting,  because  an  uncanked  or  aisranked  data 
value  will  stand  out  in  a  simple  sequential  check  of  the 
ranks.  Arithmetic  mistakes  in  squaring,  summing,  et  cetera, 
may  not  be  so  obvious  unless  the  operation  is  redone.  When 
computations  are  made  by  hand,  therefore,  nonparaaetric  test 
statisistics  are  easier  to  find. 

Since  hand-held  calculators  are  even  more  ubiqui¬ 
tous  than  parametric  tables,  this  advantage  of  nonparaaetric 
test  is  less  important  than  it  once  was.  If  the  number  of 
keystroke  errors  is  proportional  to  the  number  of  numbers 
entered  in  the  calculator,  though,  the  simplicity  of  nonpar¬ 
aaetric  tests  may  still  be  an  advantage. 

Simplicity  could  also  be  an  advantage  in  the  rare 
case  when  tables  of  distribution  are  not  available.  Hany 
nonparaaetric  test  distributions,  especially  the  simpler 
tests,  rest  on  the  binomial  distribution.  For  small  sample 
sizes,  the  binomial  can  be  calculated  by  hand;  with  a  small 
calculator,  a  distribution  for  sample  size  of  20  or  30  can 
easily  be  calculated.  The  normal,  student's  t,  chi-square 
and  F  distributions  lack  closed  forms  for  their  cumulative 
distribution  functions.  Numerical  integration--even  if  the 
density  functions  were  remembered  or  available — would  be 
challenging  with  a  calculator  (since  all  but  the  normal 
require  one  or  more  gamma  functions!  and  probably  is  beyond 
the  realm  of  paper  and  pencil  mathematics. 

Simplicity  is  also  a  virtue  in  explaining  test 
results.  Comparison  and  rankings  are  fairly  easy  to  follow 
(although  sometimes  difficult  to  perform),  while  the  arith¬ 
metic  operations  of  parametric  tests  are  less  transparent. 
Host  non-statisticians  have  had  experience  in  tossing  coins, 
rolling  dice  or  some  games  of  chance.  Analogies  can  be  made 
from  those  experiences  to  the  evaluation  of  test  results. 
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In  the  previous  exaaple  of  20  pairs  (X,T)  ,  with  the  sign 
test  producing  a  test  statistic  of  4  (i.e.r  in  four  pairs, 
X>T)  ,  an  analogy  could  ba  Bade  to  the  odds  that  a  coin  were 
fair  if  it  caae  up  heads  four  tiaes  in  20.  Explaining  that 
saae  data  set  with  the  t-test  would  require  mention  of  stan¬ 
dard  deviations  and  pooled  variances,  and  perhaps  drawing  of 
overlapping  bell-shaped  curves.  An  audience  predisposed  to 
disagree  with  the  test  results  night  be  hard  to  convince 
with  parametric  tests. 

b.  Applicability 

Nonparanetric  tests  can  be  used  on  a  larger 
nuaber  of  data  set  than  can  paraaetic  tests,  because  of  the 
lack  of  assumptions  about  the  distribution  of  the  underlying 
population,  and  the  ability  to  handla  ordinal  or  interval 
data.  This  advantage  is  more  vital  in  the  "real  world"  than 
in  academia,  for  three  general  reasons: 

(1)  An  analyst  is  sometimes  handed  data  sets  to 
do  something  with;  for  any  one  of  a  number  of  reasons,  para¬ 
metric  tests  say  not  applicable.  Host  commonly,  the  form  in 
which  the  data  was  collected  in  may  fail  to  meet  the  under¬ 
lying  requirements  of  parasetrics,  but  meets  the  require¬ 
ments  of  nonparametr ics. 

(2)  The  cost  of  gathering  interval  data  is 
prohibitive,  while  ordinal  or  nominal  data  can  be  gathered 
at  low  cost.  For  example,  on  a  fiald  exercise,  uncalibrated 
eyes  can  generally  judge  distances  greater  than  or  less 
than,  but  some  sort  of  instrumentation  must  be  used  to 
determine  the  exact  magnitude  of  the  difference.  The  expen¬ 
sively  equipped  national  test  ranges  are  a  response  to  the 
need  for  very  accurate  interval  level  of  measurement  data. 
Laboratories,  too,  are  often  equipped  for  fine  metrics.  3ut 
in  field  or  operational  tests,  ordinal  level  of  measurement 
may  be  the  highest  level  available. 


(3)  The  expedient  was  designed  to  gather 
interval  data,  from  populations  with  common  variance,  but 
something  vent  wrong.  Instrumentation  sometimes  fails. 
Sven  if  the  measurement  is  accurate,  variances  may  not  be 
common  or  the  data  nay  not  be  transformable  to  look  normal. 
Bather  than  giving  up,  or  repeating  the  experiment,  nonpar- 
ametric  tests  may  answer  some  questions  originally  to  be 
answered  by  analysis  of  variance  or  regression. 


III.  IfiE  ;OM£OXgR  £A£K£ SI 


A.  GENERAL 

This  section  gives  a  brief  description  of  the  computer 
package.  Tests  are  described  in  sections  V  through  xxvili, 
while  the  Appendices  describe  the  programs  and  files  in 
detail. 

The  computer  package  consists  of  two  compater  programs, 
Lochinvar  and  Crunch.  Also  involved  are  three  types  of  CHS 
files:  data  files  and  an  options  file,  written  by  Lochinvar 
to  be  read  by  crunch;  and  output  files,  written  by  Crunch, 
containing  test  results. 

B.  LOCHINVAR 

The  program  Lochinvar  (LOad  CHoicas  and  INput  VARiables) 
is  the  user'h  main  contact  with  the  programs.  It  is  inter¬ 
active,  prompting  and  screening  the  user's  entries.  Its 
purpose  is  to  write  two  types  of  files  needed  by  the  program 
Crunch:  the  data  file,  and  the  option  file. 
i .  m n 

Lochinvar  allows  the  user  to  input  data  in  two  sepa¬ 
rate  ways.  The  user  must  decide  which  one  to  use  before 
starting  the  program. 

a.  Hethod  I:  Data  Values  into  a  Data  Pile 

Nhen  the  data  is  numerical,  the  user  should 
strongly  consider  this  method.  Lochinvar  prompts  entry  of 
data  set  and  variable  names,  and  of  the  data  set  values 
themselves.  All  that  information  is  sent  to  a  file  on  the 
user's  A  disk;  the  user  chooses  one  of  six  files,  "lochi 
a", . . ., "lochi  f,  into  which  the  data  is  written.  Unless 
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erased  or  over- written,  the  data  will  be  available  for 

repeated  ase. 

All  24  of  the  data  tests  in  this  text  can  compute 
test  statistics  fros  data  files.  However,  a  data  file 
itself  is  not  enough  to  perform  a  test;  the  option  file  is 
needed  to  reference  the  data  file  and  coaaunicate  the  choice 
of  test  and  test  parameters  to  Crunch.  After  a  data  file  is 
written,  Lochinvar  prompts  the  writing  of  the  options  file, 
as  described  in  subsection  2,  below. 

b.  Bethod  II:  Cell  counts  in  the  Options  Pile 

When  the  data  is  non  -numerical ,  this  is  the  user's 
only  option;  it  may  be  more  convenient  even  if  the  data  is 
numerical.  Pour  of  the  nonparametric  tests  handle  nominal 
data '•binomial,  HcHemar,  chi-square  test  for  independence, 
and  Cochran.  Instead  of  entering  all  the  data  values,  the 
user  nay  run  one  of  these  tests  by  entering  the  appropriate 
count  of  elements  which  fall  into  a  test's  subcategories. 
For  example,  the  binomial  test  can  be  run  on  a  large  data 
set  by  simply  entering  two  numbers:  the  count  of  "successes" 
and  the  count  of  "failures". 

In  this  method  of  data  entry,  Lochinvar  prompts 
entry  of  cell  counts  at  the  same  time  it  is  prompting  the 
test  selection  and  choice  of  parameter.  All  that  informa¬ 
tion  is  written  to  the  options  file;  no  data  file  is 
written. 

2.  11a  SEiiana  ZU2 

The  purpose  of  the  options  file  is  to  tell  Crunch 
everything  it  needs  to  know  to  perform  a  nonparametric  test. 
In  all  cases,  the  options  file  contains  the  user's  selection 
of  which  test  to  be  performed,  the  parameters  to  be  used  in 
that  test,  and  to  which  CHS  output  file  the  output  should  be 
written.  If  the  data  values  are  to  be  used  (Method  I)  ,  the 
option  file  states  which  data  file  Crunch  should  read.  If 
cell  counts  are  used  (Bethod  II)  ,  the  options  file  contains 
those  counts,  as  well  as  the  data  set  name. 
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The  options  file  is  described  in  detail,  with  an 
example,  in  Appendix  D. 

3 .  0§ina 

In  CHS,  type  "LOCHI".  The  program  will  display  a 
sequence  of  questions;  the  user's  response  will  depend  on 
the  method  of  data  enty  and  test  selection, 
a.  Use  of  Previously  Enter  Data 

The  program  asks  "do  you  wish  to  perform  a  test 
on  a  previously  entered  data  set?",  i.e.,  one  of  the  six  CHS 
data  files  "lochi  a"  ,...  ,"lochi  f". 

If  the  answer  is  yes,  then  Lochinvar  asks  which 
data  set  to  use.  The  program  reads  the  data  set's  name,  and 
displays  it  to  the  user,  and  asks  "Is  this  the  right  data 
set?"  If  yes,  the  user  indicates  so,  and  the  test  selection 
is  next.  If  no,  the  the  user  must  choose  another  file. 

If  the  answer  to  the  question  of  previously  enter 
data  is  no,  then  Lochinvar  asks  whether  data  entry  is  by 
data  values  (Hethod  I)  or  cell  counts  (Hethod  II)  .  If  the 
user  wants  to  enter  cell  county,  the  next  step  is  to  select 
a  test.  If  the  user  wants  to  enter  a  data  set,  the  program 
asks: 

(1)  To  which  file  the  input  data  is  to  be  written 

(2)  What  is  the  data  set  name,  number  of  vari¬ 
ables  and  variable  names. 

The  program  will  then  prompt  entry  of  data 
values.  If  there  are  two  variables  in  the  data  set,  the 
user  will  be  offered  the  choice  of  entering  data  pair  by 
pair;  this  will  only  work  if  there  are  an  equal  number  of 
elements  in  each  of  the  two  variables.  Normally,  all  the 
data  elements  of  the  first  variable  will  be  entered 
together,  then  all  of  the  second  variable,  et  cetera.  After 
entry  of  each  variable,  Lochinvar  will  offer  the  user  the 
chance  to  check,  change  or  add  data  elements  to  the  values 
just  entered. 
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After  the  data  set's  information  has  been  entered, 
Lochinvar  writes  it  to  the  specified  file.  Writing  of  the 
options  file  is  next. 

b.  Selection  of  the  Test  to  be  Performed 
After  the  choice  of  data  entry  has  been  given, 

Lochinvar  offers  a  menu  of  tests  from  which  the  user  may 
choose.  If  cell  counts  was  the  chosen  method  of  data  entry, 
four  tests  are  offered.  If  a  data  file  is  to  be  used,  the 
choice  set  of  test  offered  is  determined  by: 

(1)  the  number  of  variables  in  the  data  set 

(2)  whether  the  varibles  have  an  equal  number  of 

elements 

Lochinvar  will  not  offer  a  test  unless  the  data 
set  contains  at  least  the  minimum  number  of  variables  needed 
to  perform  the  test. 

Some  non  parametric  tests  require  that  data 
elements  of  one  variable  be  related  to  corresponding 
elements  cf  other  variables,  i.e.,  that  the  data  set  be 
paired  or  blocked.  Lochinvar  will  not  offer  such  a  test 
unless  there  are  the  same  number  of  elements  in  each  of  the 
data  set's  variables.  However,  variables  can  be  equal  sized 
but  not  paired  or  blocked.  The  user  should  not  choose  an 
inappropriate  test  just  because  Lochinvar  offers  it. 

c.  Output  Pile  Selection 

The  user  chooses  among  six  possible  output  files, 
"u  listing",.. .  "z  listing".  The  files  will  be  written  to 
user's  A  disk  after  execution  of  the  program  Crunch. 
Previous  contents  of  the  designated  file  will  be  overwritten 
after  Crunch  runs. 

d.  Test  Parameter  Entries 

Each  test  has  different  entries  required.  See  the 
section  on  the  desired  test,  sections  V  through  XXVIII  for 
specifics. 


V„V.1 

V  %v 
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C.  CRUNCH 


1  •  Executing  the  Non paraaetric  £e  st 

After  the  options  file  has  been  written,  the  user 
executes  the  test  by  typing  “CRUNCH”  while  in  CHS. 

2.  ikS  2I23I&1  £&&£& 

Crunch  reads  the  options  file,  and  when  necessary, 
the  designated  data  file.  It  performs  the  non para metric 
test  requested,  and  reminds  the  user  with  a  display  on  the 
screen  to  which  output  file  the  results  hare  been  written. 
Crunch  actually  writes  three  files: 

a.  The  Output  File 

The  output  files  are  named  '•  u  listing",  ...,  "z 
listing".  At  the  top  of  of  the  file  is  the  label, 
consisting  of  the  name  of  the  data  set,  the  number  of  vari¬ 
ables  in  the  data  set,  the  number  of  variables  used  in  the 
test  (if  less  than  the  number  in  the  whole  data  set)  ,  and 
the  names  of  the  variables  used  in  the  test.  Following  the 
label  is  the  name  of  the  test  performed.  The  rest  of  the 
output  file's  format  varies,  depending  on  the  test  chosen. 
In  general,  the  output  file  contain’s  only  the  most  salient 
results  of  the  computations.  All  output  files  contain 

either  a  test  statistic,  or  a  test  statistic  and  the  empir¬ 
ical  level  of  significance  for  the  desired  null  hypothesis. 

b.  The  Problem  Listing  File 

If  there  are  any  problems  encountered  by  the 
computer  that  result  in  Fortran  error  or  warning  messages, 
such  messages  will  be  written  to  the  CHS  file  "problem 
listing" . 

c.  The  Nptest  Listing  File 

This  file  contains  intermediate  computations  of 
the  program  as  it  works  towards  the  test  statistic.  For 
example,  the  Smirncv  test  requires  finding  differences  in 
probabilities  of  two  empirical  distributions.  Since  the 


test  statistic  is  simply  the  largest  absolute  difference, 
the  output  file  will  only  list  the  largest  positive  and  most 
negative  difference,  as  well  as  the  test  statistic.  "Nptesi 
listing1*,  asong  other  things,  will  include  each  difference 
between  the  eapirical  cuaulative  distribution  functions 
(cdf*s).  This  listing  file  is  not  arranged  to  be  attractive 
or  suitable  for  subaission,  but  it  is  labelled,  and  may  be 
of  soae  use  in  understanding  the  operations  of  Crunch. 


D.  ADDITIONAL  FEATURES  OF  THE  PACKAGE 

To  avoid  overwriting  input  or  output  files,  six  of  each 
were  provided.  If  more  are  needed,  the  user  can  rename  some 
of  the  existing  files. 

Lochinvar  is  a  convenient  way  to  enter  data  by  hand. 
Hhen  data  is  generated  froa  other  prograas  or  read  from  disk 
or  tape,  the  user  may  want  to  create  his  own  data  file 
rather  than  U3e  Lochinvar.  Detailed  instructions  for  this 
are  contained  in  Appendix  C. 

Crunch  is  written  in  Fortran.  Users  who  are  experienced 
prograaaers  aay  want  to  aodify  it  to  suit  their  own  tastes, 
or  adapt  portions  of  it  into  their  own  programs.  An  expla¬ 
nation  of  the  prograa  is  contained  in  Appendix  E.  The 
prograa  is  listed  in  Appendix  G. 


E.  CONSIDERATIONS  IN  DATA  ENTRY 

Lochinvar  is  written  in  Pascal,  a  convenient  language 
for  interactive  prograas,  but  a  language  which  may  be  unfa¬ 
miliar  to  aany  users.  This  section  provides  sufficient 
insight  to  Lochinvar  so  that  the  user  can  efficiently  commu¬ 
nicate  choices  to  the  program. 
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1.  Entry  o£  Numbers 

When  entering  numerical  data,  Lochinvar  requires 
that  the  first  character  read  is  either  a  numeral  or  a  minus 
sign  followed  by  a  numeral.  Therefore,  to  enter  a  number 
between  ainus  one  and  plus  one,  precede  the  decimal  with  a 
"0". 

When  entering  aore  than  one  number  at  a  time,  leave 
a  space  (not  a  comma)  between  the  numbers. 

If  Lochinvar  expects  a  number  and  gets  anything 
else,  it  will  halt  with  an  error.  This  is  a  convenient  way 
to  get  out  of  the  program  quickly,  but  anything  entered  on 
that  run  of  Lochinvar  will  be  lost.  The  one  exception  is 
when  entering  a  data  set  of  data  values  and  an  options  file 
in  a  single  run.  After  the  data  sat  is  entered,  the  program 
writes  it  to  the  designated  file,  and  asks  "now,  do  a  test 
with  this  data?"  Any  subsequent  errors  will  not  effect  the 
data  file  itself,  just  erase  all  additional  entries. 

2.  sntLi  2t  Ckaxas fcasa 

When  the  program  prompts  for  a  single  letter 
response,  e.g.  ,  "enter  I  or  N",  enter  "Y"  or  "N"  and  press 
the  enter  key.  The  program  expects  and  will  read  only  the 
first  letter.  Single  letter  entries  are  contained  in 
"repeat. . .until"  loops  which  will  not  allow  the  user  to 
procede  until  one  of  the  offered  choices  is  made. 

Do  not  use  the  alternate  character  (apl)  setting  on 
the  keyboard  while  using  Lochinvar. 

When  Lochinvar  proapts  for  the  data  set  name  or 
variable  names,  the  user  aust  enter  from  one  to  48  charac¬ 
ters.  These  names  are  one  of  the  few  things  for  which  the 
program  does  not  offer  a  chance  to  correct  an  error.  The 
user  aust  be  careful,  checking  the  names  prior  to  hitting 
the  "enter"  key.  If  an  error  is  made  anyway,  the  data  or 
options  file  can  be  corrected  using  Xedit  in  CMS. 
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3.  Minimum  Entry 

If  the  program  prompts  for  an  entry,  enter  at  least 
one  numeral  or  character.  The  interactive  read  commands  are 
in  "do  while  not  eoln  "  (and  of  line)  loops;  if  the  enter 
key  is  pressed  with  nothing  to  enter,  Lochinvar  will  halt 
with  an  error,  because  it  attempted  to  read  beyond  the  end 
of  the  line.  As  with  numerical  antry  errors,  previous 
entries  will  be  lost. 


IV.  22§£ER  Q.Z  AVAILABLE  NONPA  RAMETRIC  TESTS 


A.  CHITERI A  P08  SELECTIN3  THE  MOST  APPROPRIATE  TEST 

In  selecting  a  nonparametric  test,  the  user  must 
consider  three  criteria:  purpose  of  the  test,  data  set  level 
of  measurement,  and  whether  the  data  elements  are  related 
(i.e.,  paired  or  blocked). 

B.  CRITERION  I:  PURPOSE  OP  TEST 

Nonparametric  tests  can  be  successfully  used  to  test  for 
questions  of  location,  independence/interaction,  and  distri¬ 
bution.  Table  1  shows  a  matrix  of  the  nonparametric  test 
with  level  of  measurement  ( LOM)  oi  the  horizontal  axis  and 
general  purpose  on  the  vertical. 

Location  usually  refers  to  whether  the  expected  value  of 
one  population  is  equal  to  that  of  another,  based  on  sample 
variables.  Other  purposes  are  testing  for  proportion  of 
successes  or  for  value  of  a  single  population's  quantile. 
Table  2  lists  the  location  tests  with  their  null  hypotheses, 
to  help  the  user  narrow  the  choice  set. 

Independence  of  two  samples  refer  to  statistical  inde¬ 
pendence,  i.e.,  correlation  of  zero.  Interaction  may  be  a 
before/after  effect,  or  prediction  of  one  variable  by 
another.  Table  3  lists  these  tests  by  their  specific  null 
hypotheses. 

Some  tests  for  distribution  check  for  equal  variance 
amonq  variables.  Other  tests  check  a  single  variable's 
empirical  cumulative  distribution  function  (ecdf)  against  a 
theoretical  distribution,  with  either  hypothesized  or  esti¬ 
mated  parameters.  Still  other  tests  compare  the  empirical 


TABLE  1 

i 

Nonpara me trie  Tests 

PU££Q§e, 
Minimum  * 

Levels  of  Measurements: 

Interval  Ordinal 

Nominal 

LOCATION: 

1  var 

Quantile 

Binomial 

2  var 

wilcoxcn*  Sign* 

Mann-Whi tney 

>2  var 

Quade*  Median 

Friedman* 

Kruskal 

-Nallis 

Cochran 

INDEPENDENCE/ 

INTERACTION: 

1  var 

Cox-Stuart 

NcNemar 

2  var 

Correlation  Cox-Stuart* 

NP  Regression 

Monotone 

Regression 

R*C  Contingency 
(chi-square) * 

DISTRIBUTION  : 

1  var 

Kolmogorov  (N<31) 

Goodness  of  Fit 
(chi-square, N>  14) 
Wilkes-Shapiro  (N<31) 
Lilliefors  (N<31) 

2  var 

Smirnov 

Cramer-von  Mises  1 

1 

>2  var 

Squared  Rank 

Hartley 

i 

*  indicates  that  variables  must  be  pal 

red  or  blocked 

j 

cdf's  of  two  variables  against  each  other.  Table  4  has  a 
list  of  these  tests,  with  their  hypotheses,  levels  of  meas¬ 
urement,  and  other  considerations. 

The  decision  matrix  indentifies  tests  suited  to  each  of 
these  purposes.  The  specific  section  on  the  test  should  be 
checked  to  be  sure  that  the  test  will  do  what  is  desired. 
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TABLE  2 

Nonpar ametric  Tests  for  Location 


HYPOTHESIS 

NAME  # 

LOM 

SEC 

PAGE 

P  (success)  *P* 

Binonia  1 

1 

N 

V 

30 

P(X<X*)  =P* 

Quantil e 

1 

0 

VI 

35 

E(X)  *E(Y) 

Sign 

2* 

0 

IX 

47 

Mann-Whitney 

2 

0 

VII 

40 

Nile  oxo  n 

2* 

I 

XI 

65 

E  (0)  .*E  (Z) 

Cochran 

>2* 

N 

XXIII 

108 

Medi an 

>2 

0 

XVII 

83 

Kruskal-Wallis 

>2 

0 

XVIII 

88 

Friedna  n 

>2* 

0 

XXI 

99 

Quade 

>2* 

I 

XXII 

104 

*  indicates  that  variables  must  be  paired  or  blocked 


T1BLE  3 

Nonparaaetric  Tests  for  Inda pendance/lnteraction 


U M 


#  245S  i ON  SEC  PAGE 


Trend (v.  order)  Cox- Stuart 

1 

0 

VII 

43 

Trend  (X  v.  Y) 

Cox- Stuart 

2* 

0 

VII 

43 

Before/After 

HcNeaar 

2* 

N 

X 

51 

rho  (X,Y)  =0 

R*C  Contingency 
(chi-square  test 
for  mdepencence) 
NP  Correlation# 

2* 

N 

XI 

56 

2 

0 

XIV 

69 

E  (Y|  X)  »a+b*X 

NP  Regression 

>2* 

I 

XV 

74 

b*0 

Monotone  Regress 

>2 

I 

XVI 

79 

*  indicates  that  variables  oust  be  paired  or  blocked 

#  gives  pearson's  and  Spearman's  rho, Kendall's  Tau 
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TABLE  4 

Nonparametric  Tests  for  Location 

NAME  *  V4£S 

iOtl 

£££ 

PAGE 

var  (W)  = 

. . .*var(Z) 

Hartley  >2 

Squared  Rank  >2 

I 

I 

XX 

XIX 

96 

92 

Fx(X)  “GJixl 
(specified 
parameters) 

Kolmogorov  1N<31)  #  1 
Gcodness  of  Fit*  1 
(chisquare,N>14) 

I 

I 

XXIV 

XX 

113 

61 

Fx(X).*G*(X) 

(estimated 

parameters) 

Lillief ors  (N<31)  1 
(normal,  exponent  ial) 
Goodness  of  Fit#  1 
Hilkes- Shapiro  1 

(norma  1,N<3 1) 

I 

I 

I 

XXV 

XII 

XXVI 

118 

6  1 

121 

Fx  (X)  “GX  (X) 

F,G: 

empirical  cdfs 

Smirnov n  2 

Cramer- von  Mises  2 

0 

0 

XXVII 

XXVIII 

124 

127 

■ 

#  tests  normal 

, unif or m , exponent ial, we ibu 11, erlang 

C.  CBITEBION  II:  LEVELS  OF  MEASUREMENT 

Levels  cf  tleasareaent  were  treated  in  section  II. 

D.  CBITEBICN  III:  RELATIONSHIP  AMONG  DATA  ELEMENTS 

Data  set  variables  may  be  unrelated,  or  paired  (if  2 
variables)  or  blocked  (if  store  than  2  variables)  .  when 
unrelated,  the  order  of  variables  in  the  first  variable  is 
not  dependent  on  the  order  of  of  all  other  variables;  e.g. , 
the  data  elements  of  a  variable  could  be  entered  in  any 
order.  In  data  sets  with  unrelated  variables,  the  number  of 
elements  do  not  have  to  be  equal. 

Paired  or  blocked  data  sets  require  sample  sizes  to  be 
equal  and  the  order  of  the  elements  of  the  variables 
natters.  Examples  of  pairing  might  be  responses  of  husbands 
and  wives,  the  intelligence  of  twins,  the  condition  of  the 
same  experimental  subject  before  and  after  the  experimental 
treatment. 


E.  SELECTING  THE  BEST  TEST 


ks  aentioaed  in  section  II,  two  different  nonparametric 
tests  applied  to  the  saae  data  sat  could  conceivably  give 
different  results  to  the  saae  hypothesis.  The  example  was 
the  sign  test  and  the  iilcoxon  signal  rank  test.  The  reason 
given  in  section  II--that  those  two  tests  extract  different 
inforaaticn  froa  the  data--underscores  the  reason  for  the 
decision  aatrix.  In  general,  the  aost  appropriate  nonparam¬ 
etric  test  is  one  that  tasts  exactly  what  is  desired,  and 
uses  all  the  available  information  contained  in  the  data. 
Using  all  the  available  information  implies  selecting  a  test 
which  requires  the  level  of  measurement  of  the  data,  and 
using  a  paired  or  blocked  test  if  appropriate. 

These  rules  require  thought.  For  example,  the  user 
aight  want  to  know  if  procedure  X  produces  better  results 
than  procedure  I.  kt  first  glance,  that  is  a  problem  of 
location:  is  E(X)*E(Y)?  However,  it  could  also  be  consid¬ 
ered  a  prcblea  of  distribution:  is  Fx(z)»Fy(z)?  The  test 
for  distribution  would  check  the  aapirical  distributions 
across  the  entire  range  of  observations,  and  thus  use  more 
of  the  information  of  the  data  set,  particularly  important 
with  small  data  sets. 

The  computer  package,  described  in  the  previous  section, 
makes  running  multiple  tests  on  the  same  data  set  fairly 
easy.  The  user  can  explore  the  results  of  using  different 
tests  on  the  saae  data,  to  get  insight  into  his  problem. 


r# 


7.  BINOMIAL  TgST 


A.  OVERVIEW 

1.  SaiEfiss 

To  characterize  p,  a  population’s  proportion  of 
"sac cesses” ,  based  on  the  sample. 

2.  Levej,  o£  Measurement 
Nominal 

3.  A§saasll2as 

The  sample  is  randomly  drawn  from  the  population. 
Data  aay  be  divided  into  any  dicotoious  groupings,  i.  e. , 
two  autually  exclusive  and  collectively  exhaustive  groups. 
Here,  the  groups  are  called  "successes"  and  "failures", 
u .  Hypotheses 

Let  p*  be  a  specified  number  0<p*<=1. 

a.  Two  tailed  test 

(1)  Null  hypothesis  h3:  p*p* 

(2)  Alternative  hypothesis  his  p#p* 

b.  One  tailed  test 

(1)  Null  hypothesis  hD:  pSp* 

(2)  Alternative  hypothesis  his  p>p* 

c.  One  tailed  test 

(1)  Null  hypothesis  hD:  p>p* 

(2)  Alternative  hypothesis  hi:  ?<p* 

B.  ACTION  REQUIRED  ET  USER  OF  PROGRAM 

1.  U&SI  Decisions  aboijt  Procrai  Options 

The  user  specifies  which  treatment  of  the  data  set 
is  used  for  the  test,  a  hypothesized  proportion  of  successes 
(p*)  ,  and  the  desired  level  of  significance  (alpha*)  .  The 


program  uses  the  count  of  successes  and  failures;  there  are 
three  nays  to  indicate  that  dichotomization: 

a.  The  user  enters  cell  counts.  This  is  the  only  way 
to  handle  non* numerical  data,  but  will  work  for  any  data. 

b.  The  user  dichotomizes  the  data  set  by  a  user 
specified  list.  This  is  the  way  to  handle  numerical, 
nominal  data.  Data  values  which  are  elements  of  the  sped* 
fied  list  are  in  one  set;  the  rest  of  the  data  values  are  in 
the  other  set.  The  user  indicates  whether  the  list  defines 
success  or  failure. 

c.  The  data  set  is  divided  in  two  by  a  partition 
value.  This  is  only  possible  if  the  data  is  numerical  and 
at  least  ordinal.  All  elements  above  the  partition  value 
are  in  one  set;  the  user  defines  whather  that  set  is  success 
or  failure.  The  values  below  the  partition  are  the  other 
type. 

2.  Program  Activation  with  Cell  Counts  (choice  a  above) 

a.  Initiation  and  test  selection 

In  CMS,  type  LOCHI.  Enter  ,,N,‘  for  no  to  question 
about  previously  entered  data.  Enter  "CM  for  cell  counts. 
Enter  "D","?",..  ."Z"  for  desired  output  file,  when  offered  a 
choice  of  tests,  enter  1. 

b.  Cell  count  entry 

Lochinvar  will  ask  the  number  of  successes  and 
number  of  failures-* enter  those  cell  counts. 

c.  Parameter  entry 

Lochinvar  will  ask  for  a  hypothesized  proportion 
of  successes  P*,  and  a  desired  level  of  significance, 
alpha*.  When  prompted,  enter  those  numbers 

3.  ilfiam  AS&jkSftti.gB.  wilk  fiali  Zllllia,  (choice  b  or  £) 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 


In  CSS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  1.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  1.  Enter  "  U",  "7",. . .  "ZH  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

fihen  prompted  by  Lochinvar: 

(1)  Enter  index  of  variable  to  be  used  in  test. 
Enter  whether  dichotomizing  by  list  or  by  partition  value 

(2)  If  by  list,  enter  the  number  of  elements  in 
that  list.  Enter  elements  of  list.  Enter  "Y",  when 
prompted  if  elements  on  that  list  represent  success. 

(3)  If  by  partition  value,  enter  that  value. 
Enter  NYN  if  data  elements  below  that  value  are  successes. 

4.  isil  Slftga&lSS 

To  perform  the  test  ,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

5.  output  Available  to  tfrq  User 

The  program  CRUNCH  displays  the  data  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  the  variable  used  in  ths  test  are  at  the  top  of 
the  output  file.  The  empirical  level  of  significance, 
alpha-hat,  and  confidence  interval  limits,  are  listed. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAH  CRUNCH 

1  •  Description  of  Calculation  of  Ley  ei  of  Significance 

iflisuais 

a.  Count  number  of  successes.  Divide  by  the  total 
number  of  elements  to  give  p-hat. 

b.  If  NS2Q,  generate  a  binomial  probability  mass 
function  with  parameters  N  and  p-hat.  Otherwise,  use  the 


with  parameters 


normal  approximation  to  the  binomial, 

N*p-hat  and  N*p-hat*  (1  -  p-hat). 

c.  Compute  tail  probabilities 

(1)  The  upper  tail  probability  is  the  likelihood 
that  the  number  of  successes  is  greater  than  or  equal  to  the 
sample  number  of  successes. 

(2)  The  lower  tail  probabilty  is  the  likelihood 
that  the  number  of  successes  is  lass  than  or  equal  to  the 
sample  number  of  successes. 

d.  compute  the  level  of  signficance,  alpha-hat 

(1)  if  hO:  p<p*,  alpha-hat  equals  the  upper  tail 

probability. 

(2)  if  hO:  p£p*,  alpha-hat  equals  the  lower  tail 

probabiltiy 

(3)  if  hO:  p*p  *,  alpha-hat  equals  two  times  the 
minimum  of  the  upper  and  lover  tail  probabilities 

e.  compute  the  interval,  B  less  than  21 

(1)  Let  yl  ■  number  of  successes  minus  one,  - 

index  of  the  pmf  of  the  lower  tail  probabilty.  Vary  p*, 
0Sp*<«p-hat,  until  the  lower  tail  probability  equals 

alpha*/2.  P*  is  the  lower  confidence  limit. 

(2)  Let  y2  *  number  of  successes  plus  one, 

■index  of  the  pmf  of  the  upper  tail  probability.  Vary  p*, 
p-hatSp*<»1,  until  the  upper  tail  probability  equals 

alpha*/2.  This  p*  is  the  upper  confidence  limit. 

f.  Compute  the  interval,  B  greater  than  20 

(1)  Compute  the  standard  error  of  estimate  by 
taking  the  square  root  of  the  quantity  (number  of  successes 
minus  the  number  of  failures,  divided  by  B  to  the  third). 

(2)  Find  the  value  z,  which  is  the  desired  quan¬ 
tile,  1-alpha*/2,  of  the  standard  normal  distribution 

(3)  The  lower  confidence  limit  is  p-hat  minus  z 
times  the  standard  error.  The  upper  confidence  limit  is 
p-hat  plus  z  times  the  standard  error. 
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2.  £££mi  2E£I3ii2S5 

a.  Deter  nine  the  cell  count 

b.  If  N  is  less  than  or  equal  to  20 

(1)  Generate  a  vector  of  N  choose  k,  k=0,1,...,N. 

(2)  Generate  a  binomial  pmf,  with  parameters  N 
and  p-hat*number  of  successes  divided  by  N. 

(3)  Compute  the  tail  probabilities 

(4)  Compute  the  empirical  level  of  significance 

(5)  Using  the  previously  generated  N  choose  k 
vector,  generate  partial  binomial  pmf's,  with  terms  0  to 
number  of  successes  minus  one,  or  number  of  successes  plus 
one  to  H.  &  bisection  search  technique  varies  p,  0<p<=p-hat 
or  p-hat<»p<*1,  until  the  sum  of  the  terms  of  the  partial 
pmf  is  within  0.0001  of  the  desired  level  of  significance 
(alpha*/2,  because  the  interval  is  two  tailed). 

c.  If  the  sample  contains  more  than  twenty  elements 

Ose  the  normal  approximation  to  the  binomial. 
The  subroutine  Muvar  uses  numerical  integration  to  find  tail 
probabilities;  from  which  the  empirical  level  of  signifi¬ 
cance  is  estimated.  Buvar  also  funishes  the  z  quantile  for 
computing  the  confidence  interval  limits. 

3.  £!23afl 

a.  Limitations 

Maximum  number  of  data  points:  400,  if  the 

program  computes  the  cell  count  from  a  data  list;  unlimited 
if  the  user  enters  the  cell  counts  directly.  Maximum  number 
of  variables:  1. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Binome,Which_vec,Hyp 

(2)  Crunch:  Bin  test;  Muvar,  Cibi21  or  Nchuzk, 
Binpmf,  Cibi20,  Bisex 

(3)  External:  Shsort 
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VI.  30  AHTI1E  TEST 


A.  OVERVIEW 

1.  general  Pa£22S® 

Th«  quantile  test  addresses  the  question  of  what 
proportion  of  the  population  is  less  than  or  equal  to  a 
specified  value.  An  example  would  be  that  a  student  might 
be  concerned  less  with  the  average  grade  on  a  test  than  the 
proportion  of  students  who  received  at  least  the  minimum 
passing  grade. 

2.  Level  2t  qea  sure  neat 
Ordinal 

3.  Assaagilaa 

The  data  set  is  a  randoa  sample  from  the  population. 

4 .  Hypotheses 

Here,  p*  is  the  specified  population  proportion, 
0Sp*<*1,  and  x*  is  a  specified  value  within  the  range  of  the 

population. 

a.  Two  tailed  test 

(1)  Bull  hypothesis  h3: 
the  p*th  quantile  *  x* 

(2)  Alternative  hypothesis  hi: 
the  p*th  quantile  #  x* 

b.  One  tailed  test 

(1)  Bull  hypothesis  hD: 
the  p*th  quantile  >  x* 

(2)  Alternative  hypothesis  hi: 
the  p*th  quantile  <  x* 

c.  One  tailed  test 

(1)  Hull  hypothesis  h3: 

the  p*t h  quantile  <  x* 


(2)  Alternative  hypothesis  hi: 
the  p*th  quantile  >  x* 

B.  ACTION  REQUIRED  BY  USER  OF  PROGRAM 

1 .  User  Decisions  abou  t  Program  Options 

The  user  must  specify  the  values  for  p*  and  x*,  and 
the  letter  of  the  hypothesis  desired. 

If  there  is  more  than  one  variable  in  the  designated 
data  set,  the  user  oust  specify  which  is  to  be  used. 

2.  Prograa  Activation  when  JJsing  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "T"  to  first  guestion,  then  enter  file  name; 
when  offered  a  test,  type  2.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  2.  Enter  " U", "V", . . . "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 
tfhen  prompted  by  Lochinvar: 

(1)  Enter  index  of  variables  to  be  used  in  test, 
if  there  is  more  than  one  in  the  data  set. 

(2)  Enter  the  values  of  p*,  x*,  the  letter  of 

the  chosen  hypothesis,  and  the  hypothesized  level  of  signif¬ 
icance,  alpha*. 

3-  2££t 

To  perforn  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

4.  Output  Available  tp  the  User 

The  prograa  CRUNCH  displays  the  data  file  to  which 
the  data  has  been  sent.  e.g..  "U  LISTING".  The  names  of  the 


data  set  and  variables  used  are  at  the  top  of  the  output 
file.  Also  included  are  the  test  statistic  (the  number  of 
data  elements  less  than  or  equal  to  x*)  ,  the  level  of 
significance,  and  confidence  limits  for  the  population's 
p*th  quantile. 


C.  ACTIONS  PERFORMS 0  BY  THE  PROGRAM  CRUNCH 

1 .  Calculation  of  Lev  el  of  Significance  and  Confidence 

a.  Compute  the  Test  Statistic 

Crunch  compare  the  data  elements  with  x*, 
counting  the  number  which  are  less  than  or  equal  to  x*.  If 
there  are  no  ties,  T1  *  T2  =  the  count.  If  some  elements 
are  equal  to  x*,  then  T1  *  the  count  of  data  elements  less 
than  x*,  and  T2  *  T1  plus  the  number  of  ties. 

b.  Computation  of  the  Level  of  Significance 

If  T1  »  T2,  the  computation  of  the  level  of 

significance  is  exactly  the  same  as  the  binomial  test,  with 
T1  as  the  numbe.  of  successes,  the  total  number  of  data 
elements  as  n,  and  p*  as  the  p*  of  the  binomial  test.  If 
T1  <  T2,  then  the  lower  tail  probability  is  calculated  using 
T1;  the  upper  tail  probability  uses  T2;  and  the  computation 
procedes  as  with  a  normal  binomial  test.  See  section  V  for 
details  about  computation  of  level  of  significance  in  the 
binomial  test. 

c.  Find  the  Confidence  Limits 

The  data  elements  are  ranked;  the  ranked  values 
are  called  "order  statistics",  so  that  X  ( 1 )  is  the  smallest 
value,  X  (k)  is  the  largest  values  in  a  data  set  of  k  values. 
The  confidence  limits  are  found  by  computing  the  values  r 
and  s,  which  are  indices  of  the  order  statistics;  once  they 
are  found,  X(r)  is  the  lower  limit,  X  (s)  is  the  larger 


(1)  With  less  than  21  data  elements,  a  binomial 
pmf  is  generated,  with  parameters  1c  =  number  of  data 
elements,  and  p  =  p*.  R  is  found  by  summing  terms  of  the 
paf  from  the  Oth  term  (P(N=0))  until  the  first  term  which 
causes  the  sum  to  be  greater  than  the  half  of  the  hypothezed 
level  of  significance  (alpha*/2)  .  Ihe  index  of  that  term  is 
r.  S  is  found  by  summing  terms  of  the  pmf  from  the  kth  term 
(P(N*k))  until  the  first  term  which  causes  the  sum  to  be 
greater  than  alpha*/2.  The  index  of  that  term  is  S. 

This  procedure  is  conservative  on  both  bounds,  and 
therefore  may  be  very  conservative  overall.  For  example, 
with  k  =  18,  p*  *  0.5,  and  alpha*=  0.15,  Crunch  would  choose 

r  of  5  and  s  of  13;  the  probability  that  in  the  population 

an  x  would  fall  between  X(5)  and  X(13)  is  0.904,  while  the 

user  would  have  been  satisfied  with  about  0.85.  With  the 

binomial  tables,  the  user  might  choose  indices  6  and  13,  or 
5  and  13,  which  would  have  only  0.0481  in  one  tail  but 
0.1151  in  the  other,  so  that  the  test  would  have  almost 
exactly  the  alpha  desired. 

(2)  With  more  than  20  data  elements  (=k)  ,  the 
hypothesized  variance  is  k(p*(1-p*));  its  square  root  is  the 
sample  standard  deviation.  The  expected  value  is  kp*.  R  is 
the  expected  value  plus  the  quantity  standard  deviation 
times  the  (alpha*/2)  quantile  of  the  standard  normal  distri¬ 
bution.  If  r  is  not  an  integer,  round  up.  s  is  the 
expected  value  plus  the  quantity  standard  devation  times  the 
(1-alpha*/2)  quantile  of  the  standard  normal  distribution. 
Again,  round  up  if  necessary  to  make  s  an  integer. 

2.  Program  Het hod  of  Operation 

The  subroutine  celler  calculates  T1  and  T2  by 
calling  the  subroutine  dicot,  which  takes  the  values  of  the 
data  set  and  find  the  number  less  than  and  the  number  less 
than  or  equal  to  the  specified  x*.  The  subroutine  quantile 
receives  that  information,  then  calls  the  subroutine  bintst 
to  perform  the  binomial  test  described  above. 
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Quantl  then  calls  subroutines  Nchuzk  and  Bmpmf  to 
generate  a  binomial  pmf  to  calculate  the  confidence  limit 
indices  r  and  s,  if  the  total  number  of  values  is  20  or 
less.  Otherwise,  Quantl  calls  the  subroutine  Normv  for 
values  of  the  (alpha*/2)  and  (1-alpha*/2)  quantile  of  the 
normal  distribution  to  use  in  the  calculation  of  r  and  s. 
The  data  values  are  sorted  by  the  subroutine  RNQ,  and  the 
X(r)  and  X(s)  values  dereferenced  as  the  confidence  limits. 

3 •  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400.  Only  one 
variable  is  used. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Quantile 

(2)  Crunch:  Quantl,  Cellar,  Dicot,  Bintst;  Rnq, 
Nchuzk,  Binpmf  or  Normv 

(3)  External:  Shsort 


VII.  MANN  WHITNEY  TEgT 


A.  OVERVIEW 

1 .  General  Purpose 

With  two  variables,  test  whether  the  populations 
from  which  the  variables  ware  drawn  have  the  same  mean. 

2.  Level  o£  Measurement 
Ordinal 

3 .  Assumptions 

a.  Each  variable  is  a  random  sample. 

b.  There  is  mutual  independence  between  samples. 

4 .  Hypotheses 

a.  Two  tailed  test 

(1)  Null  hypothesis  hO:  E(X)=E(Y) 

(2)  Alternative  hypothesis  hi:  E(X)#E(Y) 

b.  One  tailed  test 

(1)  Null  hypothesis  h3:  B(X)>E(Y) 

(2)  Alternative  hypothesis  hi:  E(X)<E(Y) 

c.  One  tailed  test 

(1)  Null  hypothesis  hO:  E(X)<E(Y) 

(2)  Alternative  hypothesis  hi:  E(X)>E(Y) 


B.  ACTION  REQUIRED  EY  USER  OF  PROGRAM 

1 .  Oser  Decisio ns  about  Program  Options 

The  user  must  decide  which  two  of  the  variables  to 
use  in  the  te3t  if  the  data  set  contains  more  than  two.  The 
user  specifies  the  level  of  significance. 

2.  Program  Act ivatiqn  when  Using  Data  Values 
a.  Initial  Choices 

See  section  III.C  for  initial  choice  details. 
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In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  MTM  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  4.  if  data  needs  to  be  entered, 
enter  WN"  for  first  question,  '*7"  for  second;  when  data 
entered,  choose  to  do  a  test;  whan  offered  a  choice  of 
tests,  type  4.  Enter  "U,,,,,vM,..  •  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 
When  prompted  by  Lochinvar: 

(1)  Enter  indices  of  variables  to  be  used  in 

test. 

(2)  Enter  the  level  of  significance. 

3.  Test  Execution 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CBUNCH  while  in  CMS. 

4.  output  Available  to  the  User 

.-v.  The  program  CBUNCH  displays  tha  output  file  to  which 

V\\\ 

the  data  has  been  sent,  e.g.,  ”0  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  empirical  level  of  significance  and  whether  to 
reject  the  null  hypothesis  are  stated. 


C.  ACTIONS  PEBFOBMED  BT  THE  PBOGRAM  CRUNCH 

1 •  Description  of  calculation  of  Test  Statistic/Level 
of  Significance 

a.  Sd«s  of  the  Ranks  of  tha  First  Variable 

Bank  the  values  of  the  two  variables.  Sum  the 
ranks  of  each.  The  test  statistic  I  is  the  sum  ^f  the  ranks 
of  the  first  variable. 

b.  Overall  Test  Statistic 

The  overall  test  statistic,  if  their  are  many 
ties,  is  T1.  Let  n  be  the  number  of  elements  in  the  first 
variable,  m  the  number  of  elements  in  the  second  variable;  N 
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is  the  sub  of  n  and  m.  Sub  the  squared  ranks  of  both  vari¬ 
ables  to  find  K2.  The  numerator  of  the  test  statistic  is  T 
minus  n(N+1)/2.  The  denominator  is  the  square  root  of  the 
difference  of  two  terms:  R2*nm/N (N- 1 ) ,  and 

(na  (H+1)  **2)  /4  (N-1)  . 

2.  Program  Set  hod  of  0  Deration 

The  subroutine  Manwht  calls  the  nonimsl  subroutine 
Utest,  to  get  the  level  of  significance. 

3 •  Program  Characteristic^ 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables:  2. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar :  Manwhit 

(2)  crunch:  Manwht 

(3)  External:  Dtest 


VIII.  COX-STO ART  TESJ  FOR  JR END 

A.  OVERVIEW 

1 .  General  Purpose 

This  is  a  specialized  fori  of  the  sign  test,  to 
determine  if  the  data  eleaents  ia  the  second  half  of  a 
sample  are  significantly  greater  or  less  than  ^.he  corre¬ 
sponding  values  in  the  first  half.  With  one  variable,  the 
order  of  the  data  elements  determines  how  the  data  elements 
are  paired  for  comparison. 

With  two  variables,  in  which  the  data  elements  are 
paired,  the  ranks  of  the  independent  variable  orders  the 
values  of  the  dependent.  This  yields  a  rough  test  of  the 
interaction  or  dependence  of  the  two  variables. 

2.  IsiSl  it  a&isvff.smeat 

Ordinal 

3 .  l5SM£tioas 

a.  The  data  elements  represent  random  draws 

b.  Either  there  is  no  trend,  or  there  is  an  under¬ 
lying  trend  upward  or  downward  throughout  data. 

4 .  2l£Q  theses 

Where  (♦)  indicates  the  second  data  element  is 
greater  than  the  first  in  the  pair,  otherwise  (-)  ,  so  that 
PC*)*P(-)  indicates  no  trend,  P(*)>P(-)  indicates  upward 
trend  (1  variable)  or  vary  directly  (1st  variable  to  2d 
variable),  and  P  (v)<P(-)  indicates  downward  trend  or  vary  an 
inverse  relationship. 

a.  Two  tailed  test 

(1)  Null  hypothesis  hO:  P(  +  )=P(-) 

(2)  Alternative  hypothesis  hi:  ?(+)*P(-) 

b.  One  tailed  test 

(1)  Null  hypothesis  hO:  P(+)<P(-) 


(2)  Alternative  hypothesis  hi:  P(+)>P(-) 

c.  One  tailed  test 

(1)  Mull  hypothesis  hO:  P(  +  )>P(-) 

(2)  Alternative  hypothesis  hi:  P(«-)<P(-) 

B.  ACTICH  REQUIRED  EY  USER  OF  PROGRAM 

1 .  User  Decisio ns  a boat  Program  Options 

If  one  variable  is  used,  make  sure  data  elements  are 
in  the  order  (often  order  of  time)  for  the  trend  to  be 
tested.  If  two  variables  are  to  be  used,  data  elements  must 
be  paired.  The  user  selects  the  type  of  hypothesis  and  the 
level  of  significance.  If  there  are  two  variables,  the  user 
specifies  which  is  dependant. 

2.  Program  Activation  when  U§inq  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "I"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  3.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  whan  offered  a  choice  of 
tests,  type  3.  Enter  "  Uw,  ,,y,,r . . .  "Z”  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  index  or  indices  of  variables  to  be 
used  in  test.  Enter  pairs  to  ba  excluded,  if  using  two 
variables. 

(2)  Enter  letter  of  hypothesis  type  and  level  of 
significance. 

(2)  If  there  are  two  variables,  enter  which  is 

dependent. 
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4 •  Test  Sxecut ion 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

5.  Output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1.  Description  of  Calculation  of  Level  of  Significance 

a.  Fora  Pairs  (X,Y)  Significance 

(1)  If  there  is  one  variable,  call  it  I,  and  go 
to  step  (2).  If  two  variables,  let  X  be  the  independent 
variable,  and  Y  be  dependent.  Find  the  order  of  the  X.  Use 
that  ordering  to  re-order  Y  in  ascending  order  of  the  values 
of  X. 

(2)  Let  k  be  the  number  of  elements  in  the  vari¬ 
able  Y.  If  K  is  even  then  there  will  be  (k/2)  pairs,  if  odd 
then  ((k-1)/2)  pairs.  Let  np  equal  the  number  of  pairs. 

b.  Find  Test  Statistic 

(1)  For  i  =  1  to  np,  compare  Y(i)  to  Y(np-i*1).  If 
Y(i)  >  Y(np-i*1)  then  increment  the  counter  for  failures.  If 
Y(i)  <  Y(np-i  +  1)  then  increment  the  counter  for  successes. 
The  test  statistic  is  from  the  binomial  test,  with  n  equal 
the  number  of  successes  plus  failures,  p*  equal  0.5,  and  the 
number  of  success  equal  number  of  times  the  second  half  data 
elements  are  strictly  greater  than  the  corresponding  first 
half  elements. 

2.  Pro  era  m  Method  of  0  peration 

a.  Ordering  the  Dependent  Variable 

Tha  subroutine  Coxstu  dereferences  the  X  values 
into  the  INDEP  vector,  sorts  those  values  using  Shsort,  and 


uses  the  indices  returned  xn  the  vector  KEY  to  dereference 
the  Y  values  into  a  vector  TLIST . 

If  there  is  only  one  variable,  use  a  do  for  loop 
to  load  its  values,  in  their  current  order,  into  TLIST. 

b.  computing  the  Test  Statistic 

The  subroutine  Coxstu  calls  the  subroutine  Signt 
to  perfora  the  sign  test.  See  section  IX. 

3 .  Pro oraa  £ka£§ctgristiss 

a.  Limitations 

Maximum  number  of  data  points:  490.  Maximum 
number  of  variables:  2. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Coxstu 

(2)  Crunch:  Coxstu,  Signt 

(3)  External:  Shsort 


IX.  SIGN  TEST 


A.  OVERVIEW 

1 .  ssaml  £2££0£i 

with  pairs  of  data  (Xi,Yi) ,  determine  if  the 
expected  value  of  the  first  element,  E(X),  is  greater  than 
the  expected  value  of  the  second  elea  ant,  E (I) . 

2.  level  o£ 

Ordinal  within  aach  pair;  that  is,  it  can  be  deter¬ 
mined  that  Xi  is  greater  than  Yi  (+)  ,  or  that  Yi  is  greater 
than  Xi  (-)  ,  or  that  they  are  equal.  The  values  of  the  pair 
do  not  have  to  be  ordered  with  any  values  outside  the  pair. 

3 .  Assumptions 

a.  Random  Sample 

X,Y  are  a  random  sample  of  the  population  pairs. 

b.  Level  of  Heasureaent 

Ordinal  within  each  pair,  nominal  among  pairs. 

c.  Invariance  of  Probability  that  Xi  >  Yi 

The  underlying  population  from  which  the  sample 
is  drawn  is  assumed  to  have  a  constant  P  (X>Y)  . 
flIE2£k§Si§ 

Here,  (♦)  is  the  case  that  X>Y,  (-)  is  the  case  that 
T>X,  where  X,I  refer  to  population  random  variables. 

a.  Two  tailed  test 

(1)  Hull  hypothesis  hO:  P(+)=P(-) 

(2)  Alternative  hypothesis  hi:  P  (♦)  #P  (-) 

b.  One  tailed  test 

(1)  Null  hypothesis  hO:  P(+)<P(-) 

(2)  Alternative  hypothesis  hi:  ?(+)>P(-) 

c.  One  tailed  test 

(1)  Null  hypothesis  hO:  P(*)>P(-) 

(2)  Alternative  hypothesis  hi:  P  (*)<P(~) 


B.  ACTION  REQUIRED  BY  USER  OF  PROGRA H 


1 .  User  Decisions  about  Program  Options 

It  is  possible  that  the  data  is  non-numer ical ,  but 
ordinal  within  pairs,  e.g.,  a  list  of  winners  of  arm¬ 
wrestling  contest  between  two  platoons  as  a  measure  of  arm 
strength.  If  so,  then  perform  the  test  as  a  binomial  test, 
entering  cell  counts,  with  number  of  successes  as  the  number 
of  times  Xi  >  Yi,  n  is  the  total  number  of  untied  pairs,  and 
p*  as  0.5. 

Bith  numerical  data,  the  program  will  make  the 
comparisons.  The  user  must  enter  the  hypothesis  letter  and 
the  hypothesized  level  of  significance. 

If  there  are  more  than  two  variables  in  the  desig¬ 
nated  data  set,  the  user  must  choose  which  are  included  in 
the  test. 

2.  Program  Activation  when  Usiaq  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  MTM  to  first  question,  then  enter  file-  name; 
when  offered  a  test,  type  5.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  5.  Enter  " UM, "V",. . . "Zn  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  indices  of  variables  to  be  used  in 
test.  Enter  blocks  or  pairs  to  be  excluded,  if  any. 

(2)  Enter  the  letter  of  the  hypothesis  type  and 
the  hypothesized  level  of  significance,  alpha*. 


•  3.  ye§t  Execution 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

4.  Output  Available  to  the  Us§£ 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISIING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  test  statistic  and  the  level  of  significance  are 
printed,  as  well  as  whether  to  reject  the  null  hypothesis. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1 .  Description  of  Calc ulation  o£  ^evel  of  significan ce 

a.  Calculation  of  the  Test  Statistic 

For  each  pair  (Xi,Yi),  subtract  Yi  from  xi. 
Count  the  number  of  times  the  difference  is  greater  than  0: 
this  is  the  test  statistic.  The  total  number  of  untied 
pairs  is  the  count  of  times  the  difference  is  not  zero. 

b.  Calculation  of  the  Level  of  Significance 

The  level  of  significance  is  calculated  by  the 
binomial  test,  with  the  number  of  successes  equal  to  the 
test  statistic,  the  total  sample  size  equal  to  the  number  of 
untied  pairs,  and  the  hypothesized  proportion  of  success  p* 
equal  to  0.5. 

See  section  V  for  details  about  computation  of 
levels  of  significance  by  the  binomial  test. 

2.  Eiggaa  aat&sd  of  station 

The  subroutine  Signt  compares  the  values  of  the 
lover  indexed  variable  with  the  corresponding  values  of  the 
other  variable.  Values  within  epsilon  (=0.0001  by  default) 
are  considered  tied.  If  not  tied,  then  the  program  checks 
whether  Xi>Yi  or  Xi<Yi,  and  increments  the  appropriate 
count,  (♦)  or  (-)  .  Signt  then,  calls  the  subroutine  Bintst 
to  compute  the  level  of  significance. 
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X.  MCNE  MAR  TEST  FOR  SIGNIFICANCE  OF  CHANGE 

A.  OVERVIEW 

1 .  General  Purpose 

With  data  in  pairs  (Xi,Yi),  and  x  and  y  values  each 
classifiable  as  successes  or  failures,  to  determine  if  a 
significant  change  has  occurred. 

2.  Level  Wea  sure  meg  t 
Nominal 

3.  Assuaptj  oqs 

a.  The  pairs  are  a  random  sample. 

b.  X  and  I  can  be  divided  into  mutually  exclusive 
and  collectively  exhaustive  groups:  success  and  failure. 

c.  Each  of  the  k  pairs  can  be  categorized  intc  one 
of  four  cells:  success/success  (1,1),  success/failure  (1,0), 
failure/success  (0,1),  and  failure/failure  (0,0). 

4.  3X£°£k23i£ 

a.  Null  hypothesis 

HO:  P  (0, 1)»P(1,0)  for  all  i 

b.  Alternative  hypothesis 

HI:  P  (0,  1)  #P  ( 1 , 0 )  for  all  i 

B.  ACTION  REQUIRED  ET  0SER  OF  PROGRAM 

1.  Saar  Pecisigp?  abotjt  Program  Options 

If  the  data  is  nominal,  the  user  must  enter  the 
counts  of  each  of  the  four  categories. 

If  the  data  is  ordinal  or  above,  the  user  may  enter 
cell  counts,  or  may  specify  partition  values  to  dichotomize 
X  and  T.  Lochinvar  will  ask  for  the  partition  value  for  X, 
and  whether  successes  are  less  than  it.  For  example,  if 


17.5  is  the  partition  value  for  X,  and  successes  are  greater 
than  it,  any  X  value  greater  than  17.5  will  be  a  success; 
depending  on  its  Y  value,  the  pair  will  be  counted  in  the 
success/success  (1,1)  cell,  or  success/failure  (1,0)  cell. 
After  asking  for  the  X  partition  value,  Lochinvar  will  ask 
for  the  Y  partition  value  and  whether  successes  lie  above  or 
below. 

If  there  are  more  than  two  variables  in  the  desig¬ 
nated  data  set,  Lochinvar  will  ask  which  two  are  to  be  used 
for  the  test. 

2.  Progra m  Activation  when  Using  Cell  Counts 

a.  Initiation  and  test  selection 

In  CMS,  type  LOCHI.  Enter  "N"  for  no  to  question 
about  previously  entered  data.  Enter  "C"  for  cell  counts. 
Enter  "O" ,”7”, . .  ."Zn  for  desired  output  file,  when  offered  a 
choice  of  tests,  enter  6. 

b.  Cell  count  entry 

Lochinvar  will  ask  the  cell  counts  for  each  of 


the  four  categories;  success/success,  success/failure, 
failure/success,  and  failure/failure. 

Enter  the  hypothesized  level  of  significance. 

3 .  Program  Activation  when  Using  Data  Values 
a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  MY"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  6.  If  data  needs  to  be  entered, 
enter  MNW  for  first  question,  MV"  for  second;  when  data 
entered,  choose  to  do  a  test:  when  offered  a  choice  of 


b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  indices  of  the  two  variables  to  be 
used  in  test.  Enter  blocks  or  pairs  to  be  excluded,  if  any. 

(2)  Enter  the  partition  value  for  the  lower 
indexed  variable  and  whether  successes  are  lass  than  that 
value.  Enter  the  partition  value  for  the  variable  with  the 
higher  index;  indicate  whether  successes  are  less  than  that 
value. 

(3)  Enter  the  hypothesized  level  of  significance. 

“•  les*  SiesaSlon 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

5.  Out  nut  Available  to  t&e  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING''.  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file. 

If  there  are  20  or  less  pairs  in  the  data  set  which 
are  classified  as  (1,0)  or  (0,1),  then  the  test  statistic  is 
T2*number  of  pairs  classified  as  (1,0);  that  test  statistic 
and  level  of  siginif icance  will  be  listed.  Otherwise,  the 
test  statistic  T1,  explained  below,  and  the  level  of  signif¬ 
icance  will  be  given. 

In  either  case,  the  program  will  compare  the  empir¬ 
ical  level  of  significance  with  the  hypothesised,  and  state 
whether  the  null  hypothesis  is  rejected. 


C.  ACTIONS  PERFORHED  BY  THE  PROGRAH  CRUNCH 

1 .  Descript  ion  of  Calc  ulation  of  Level  of  Significance 
a.  Computation  of  cell  Counts 

If  cell  counts  were  provided  by  the  user,  go  to 
step  b  or  c.  If  data  values  were  used.  Crunch  examines  each 


pair  (Xi,Yi)  to  see  the  relation  each  data  element  has  to 
its  corresponding  partition  value.  Crunch  decides  which 
cell  the  pair  belongs  to,  ( 1 , 1)  , . . .  (0  ,3)  ,  and  increments  the 
count  for  that  cell. 

b.  If  Count  for  Cells  (1,0)  and  (0,1)  less  than  21 

The  test  statistic  is  T2,  'hich  equals  the  count 
of  cell  (1,0).  The  binomial  test  is  invoked  to  find  the 
level  of  significance.  T2  is  the  number  of  successes;  the 
total  count  of  cells  (1,0)  and  (0,1)  as  n,  the  total  sample 
size;  the  hypothesized  proportion  of  success,  p*  is  0.5. 
Sea  section  V  for  details  of  computation. 

c.  If  Count  for  Cells  (1,0)  and  (0,1)  exceeds  20 

The  test  statistic  is  T1 ,  equal  to  the  squared 
difference  between  the  cell  counts  (1,0)  and  (0,1),  divided 
by  their  sum.  This  test  statistic  is  compared  to  the  chi- 
square  distribution  with  one  degree  of  freedom  to  find  the 
level  of  significance. 

2.  Program  Method  of  0  peration 

a.  Cell  Count  Calculation 

The  subroutine  Cellar  compares  each  of  the  lower 
indexed  values  to  the  first  partition  value,  then  each  of 
the  other  variable's  values  to  the  second  partition  value. 
kt  each  comparison,  a  "1”  is  assigned  to  an  integer  vector 
if  the  value  is  a  success,  otherwise  a  "0"  is  assigned.  The 
resulting  integer  vector,  with  the  same  number  of  elements 
as  the  original  data  set,  is  then  used  to  find  cell  counts. 
With  k  pairs,  the  first  through  the  kth  elements  of  the 
integer  vector  are  multiplied  by  two,  then  added  to  the 
corresponding  k*1st  through  2kth  elements.  The  resulting  k 
element  vector  has  values  0,1, 2, 3,  corresponding  to  the  four 
cells.  The  vector  is  then  used  to  compute  cell  counts. 

b.  Calculation  of  the  Test  Statistic 

Crunch  figures  if  the  cell  counts  of  (1,0)  and 
(0,1)  total  more  than  20.  If  so,  T1  is  computed,  otherwise 


c.  Calculation  of  the  Level  of  Significance 

If  T 1  is  used,  the  subroutine  acnemar  performs 
linear  interpolation  of  tabled  values  of  the  chi-square 
distribution  with  one  degree  of  freedoa.  If  T2  is  used,  the 
binomial  distribution  subroutine  Bintst  is  used  to  calculate 
the  level  of  significance. 

3 .  ££2mi  gU£3£terist^cs 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  pairs:  200  if  program  computes  cell  counts;  no 

limit  if  the  user  enters  cell  counts. 

b.  Subro utines/Procedures  Used 

(1)  Lochinvar:  Cel_test,  McNemar 

(2)  Crunch:  McNemr,  Cellar,  Recount 

(3)  External:  Shsort 


XI.  CHI-SOJARE  TEST  FOR  INDEPENDENCE  (R*C  CONTINGENCY) 


A.  OVERVIEW 

1 .  General  Purpose 

Osed  with  a  data  set  composed  of  paired  data 
elements,  to  test  whether  the  classification  or  value  of  the 
first  element  of  the  pair  is  statistically  independent  of 
the  classification  or  value  of  the  second. 

2.  Level  of  Mea  sure men  t 

Nominal 

3.  Assumptions 

a.  The  pairs  is  randomly  drawn  from  the  population. 

b.  The  first  element  of  each  pair  can  be  placed  in 
exactly  one  of  r  categories  (rows) ,  and  the  second  element 
of  each  pair  can  be  placed  in  exactly  one  of  c  categories 
(columns)  . 

4 .  Hypothesis 

a.  Null  hypothesis 

HO:  P(pair  is  in  row  i  and  col  j)  equals  P  (first 

element  in  row  i)  times  P (second  element  in  row  j) 

b.  Alternative  hypothesis 

HI:  P(pair  is  in  row  i  and  col  j)  does  not  equal 

P(first  element  in  rcw  i)  times  P  (second  element  in  row  j) 

B.  ACTION  REQUIRED  BY  USER  OF  PROGRAM 

1 •  User  pecisio  ns  about  Program  Options 

a.  Entry  of  cell  counts 

If  the  data  is  nominal,  the  user  must  decide  the 
categories  of  the  rows  and  columns,  and  enter  the  observed 
cell  counts,  Oij,  for  each  cell.  The  user  may  use  this 
technique  for  data  of  a  higher  level  of  measurement. 


b.  Entry  of  fixed  row  and  column  totals 

If  the  data  is  ordinal  or  above,  and  the  categor¬ 
ization  of  the  data  F°in-ts  is  by  their  order  (e.g.,  not  by 
whether  their  last  digit  is  odd),  the  user  may  enter  Ri's 
and  Cj's.  The  sums  of  the  Ri's  and  Cj's  must  exactly  equal 
the  number  of  pairs  in  the  data  set. 

c.  Entry  of  row  and  column  partition  values 

If  the  data  is  ordinal  or  above,  and  the  categor¬ 
ization  is  based  on  order,  the  user  can  specify  (r-1)  values 
to  divide  the  data  elements  into  r  rows,  and  (c-1)  values  to 
divide  the  data  elements  into  c  columns.  For  example,  a 
data  pair  would  fall  into  cell  (1,2)  if  the  first  element  of 
the  pair  were  less  than  or  equal  to  the  first  row  partition 
value,  and  the  second  element  of  the  pair  were  greater  than 
the  first  column  partiton  value  but  less  than  or  equal  to 
the  second. 

2.  Program  Activation  when  Osi&q  Cell  Counts  (choice  a) 

a.  Initiation  and  test  selection 

In  CHS,  type  LOCHI.  Enter  "H"  for  no  to  question 
about  previously  entered  data.  Enter  "C"  for  cell  counts. 

Enter  "0"  ,"V'', . .  ."Z"  for  desired  output  file,  when  offered  a 

choice  of  tests,  enter  7. 

b.  Cell  count  entry 

Lochinvar  will  ask  the  number  of  rows  and 

columns.  It  will  then  prompt  entry  of  the  observed  cell 

count  for  each  cell. 

3.  Pro  gram  Activation  ysj.i)q  J&ta  y^ues  (choice  b  or  c) 
a.  Initial  Choices 

See  section  III.c  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  HY"  to  first  question,  then  enter  file  name; 
when  offered  a  t9St,  type  7.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "7"  for  second;  enter  the 


columns. 


data;  when  offered  a  choice  of  tests,  type  7.  Enter 
"0" , " V" ,  . . .  k  z'»  for  desired  output  file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  indices  of  the  two  variables  to  be 
used  in  test.  Enter  the  number  of  rows  and  the  number  of 
columns.  Enter  whether  using  fixed  row  and  column  totals  or 
partition  values. 

(2)  If  by  fixed  totals,  enter  the  row  totals 
when  prompted,  then  the  column  totals. 

(3)  If  by  partition  value,  enter  the  row  parti-* 
tion  values  when  prompted,  then  the  column  partition  values. 

4.  lest  2xe££t!2£ 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

5.  Out£j&  Available  to  t^a  fiOE 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  data  set  and 
variable  names  are  at  the  top  of  the  output  file.  The  test 
statistic  T,  the  degrees  of  freedom,  and  the  matrix  of 
observed  cell  counts  are  listed. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 


1 •  Description  of  Computation  of  lest  Statistic 
The  program  Crunch  will: 
a.  Find  the  observed  cell  count 

Specify  the  r  rows  and  c  columns,  thus  creating 
r*c  mutually  exclusive  cells.  Count  the  number  of  pairs 
which  fall  into  the  categorization  of  each  cell:  that  number 
is  the  observed  cell  count.  Oil,  of  cell  ij. 


b.  Find  the  expected  cell  coant 

All  the  cell  counts  in  row  i  are  summed  to  find 
the  row  total  Si.  All  the  cell  counts  in  column  j  are 
summed  to  find  the  cclumn  total  Ci.  The  expected  cell  count 
Eij  is  found  by  multiplying  Si  times  Cj  and  dividing  by  the 
total  number  of  pairs,  N. 

c.  Find  the  test  statistic  T 

For  each  cell,  subtract  Oij  from  Eij.  Square 
each  difference.  Divide  each  squared  difference  by  the 
corresponding  Eij.  Sum  those  quotients  to  find  the  test 
statistic  T,  which  can  be  compared  to  a  chi-square  distribu¬ 
tion  with  (r-1)*(c-1)  degrees  of  freedom.  The  quantile  of 
the  chi-square  distribution  corresponding  to  T  indicates  the 
empirical  confidence  coefficient,  or  the  complement  of  the 
empirical  level  of  significance. 

2.  gygqsaa. 

a.  computation  of  the  cell  counts 

If  the  user  supplied  the  cell  counts,  go  to  step 
b.  If  the  user  gave  fixed  row  and  column  totals,  rank  all 
the  first  elements  to  find  the  row  partition  values,  and 
rank  all  the  second  elements  to  find  the  column  partition 
values.  Compare  the  first  element  of  each  pair  to  the  row 
partitions:  assign  to  each  pair  the  index  of  the  largest 
partition  value  which  the  first  element  is  less  than;  if  the 
first  element  is  larger  than  the  largest  partition  value,  it 
is  assigned  the  integer  r.  Do  the  same  with  the  second 
elements  and  the  column  partition  values.  Each  pair  now  has 
two  integers  assigned,  which  designate  membership  in  a  row 
and  in  a  column.  Count  the  number  of  pairs  falling  in  each 
cell:  this  is  the  observed  cell  count  Oij. 

b.  Compute  the  test  statistic  T 

Find  the  row  and  column  totals  by  summing  oij's. 
Compute  the  expected  cell  counts  Eij  by  ai*Cj/N.  Square  the 
difference  (Oij-Eij)  ,  divide  by  Eij,  sum  over  all  cells. 


3 .  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400,  if  the 
program  computes  the  cell  counts;  unlimited  if  the  user 
supplies  cell  counts.  Maximum  number  of  variables/ 
treatments:  2.  Maximum  total  cslls  (r*c)  :  50. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  cel_test,  Rc_cont ;  or  which_test, 
Rc_cont, Settee  11  ,Which_v ec,  Exclud_b 

(2)  Crunch:  Cell er, Party, rc3  or  Tc4, Rcctr , Rcont 

(3)  External:  shsort 
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ni.  CHI -SQD ARE  GOODNESS  OP  FIT  TEST 

A.  OVERVIEW 

1  •  General  fogpose 

With  a  sample  of  15  or  mors  data  elements,  test  if 
that  sample  could  be  drawn  from  a  specified  distribution. 

2.  Level  o£  Measurement 
Interval 

3 .  Assumptions 

The  variable  contains  a  random  sample. 

4 .  Hypothesis 

Where  F*  (x)  is  hypothesised  distribution  function, 
and  F  (x)  is  the  distribution  of  the  population. 

a.  Null  hypothesis 

HO:  F(x)*F*(x)  for  all  l 

b.  Alternative  hypothesis 

HI:  F(x)#F*(x)  for  at  least  one  x 

B.  ACTION  REQOIRED  ET  OSER  OF  PROGRAM 

1 •  Oser  Decisions  about  Program  Options 

The  user  must  decide  among  five  distributions  for 
which  to  test.  Parameters  may  be  specified  by  the  user,  or 
nay  be  estimated  by  the  program  (using  method  of  moments)  . 

2.  Program  Agt i vatlon  wheg  Osiaj  Data  Values 
a.  Initial  Choices 

See  section  IIl.c  for  initial  choice  details. 

In  CHS,  type  L3CHI.  If  data  is  already  in  a  data 
file,  enter  "T"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  8.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V”  for  second;  enter  data. 


and  when  offered  a  choice  of  tests,  type  8.  Enter 
how, MZ"  for  desired  output  file. 

b.  Variable  and  Parameter  Choice 

when  prompted  by  Lochinvar: 

(1)  Enter  indices  of  variables  to  be  used  in 
test.  Enter  blocks  or  pairs  to  be  excluded,  if  any. 

(2)  Enter  theoretical  distribution  .  Choose 
among:  unifori,  normal,  exponential,  erlang  and  viebull. 

(3)  Enter  the  parameters  of  the  theoretical 
distribution,  or  request  that  the  program  estimate  the 
parameters.  In  the  normal  and  exponential  the  method  of 
moments  can  be  invoked  to  allov  the  program  to  estimate  the 
maximum  likelihood  estimates  of  the  parameters.  In  the  case 
of  the  erlang,  the  user  must  specify  the  integer  shape 
parameter  n,  but  can  allov  the  program  to  estimate  the  scale 
parameter  lambda.  In  the  case  of  the  uniform  and  the 
veibull,  the  user  must  specify  the  parameters 

3.  Test  Execution 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  Available  tg  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 

data  set  and  variables  used  are  at  the  top  of  the  output 

file.  The  test  statistic  T  and  the  degrees  of  freedom  will 
be  displayed.  Finally,  the  matrix  of  cell  counts,  expected 
and  observed,  will  be  given. 

C.  ACTIONS  PERFORMED  BT  THE  PROGRAM  CRUNCH 

1.  Description  of  Calculation  of  Test  Statistic 
a.  Calculation  of  the  Expected  Cell  Counts 

Let  k  equal  the  number  of  data  elements.  If 

15Sk<»50,  divide  k  by  5  and  round  to  the  nearest  integer. 


This  will  be  C ,  the  number  of  cells;  e.g.,  k=  1 7  implies  C=3, 
k*18  implies  C=4.  In  all  cells  except  perhaps  the  first  and 
last  the  expected  cell  count  will  be  5.  If  C*5=k,  then  the 
expected  cell  counts  for  all  cells  is  5.  If  C*5=k+1,  rhe 
expected  cell  count  of  the  first  call  is  4;  if  C*5=k+2,  the 
expected  cell  count  of  the  first  and  last  cells  is  4.  If 
C*5=k-1,  the  expected  cell  count  of  the  first  cell  is  6;  if 
C*5=k-2,  the  expected  cell  count  of  the  first  and  last  cells 
is  6.  For  example,  if  k=17,  C=3;  since  3*5=17-2,  the  first 
and  last  cells  have  expected  counts  of  6,  i.e.,  the  expected 
cell  counts  are  6,5,6. 

If  k>50,  there  will  be  10  calls.  Divide  k  by  10 
and  truncate;  that  will  be  the  minimum  expected  cell  count 
E.  The  remainder  of  the  truncation  is  rem=k-10*E;  add  1  to 
the  cell  count  of  the  last  rem  cells. 

The  expected  cell  counts  are  stored  in  the 
vector  EJ.  This  is  the  building  block  of  the  F=P  (X<x) 
values  of  the  theoretical  distribution.  Scan  the  EJ  vector 
to  find  the  CUN  vector,  i.e.,  cun  (1 )  =EJ  ( 1)  , 
CUN  ( 2)  s E J  ( 1 )  +E  J  ( 2)  ,  ...»  CUH(j)=sum  of  EJ  (i)  ,  i=1,..,j. 

Divide  each  element  of  the  CUN  vector  by  the  number  of 
elements,  k,  to  fora  the  CUHP  vector. 

b.  The  Partition  Values 

The  usual  form  of  theoretical  distributions  takes 
a  value  x  and  returns  a  value  F,  such  that  F  *  P  (X<x)  .  In 
this  case,  the  inverse  form  of  the  theoretical  distributions 
is  used  to  take  an  F,  contained  in  the  CUNP  vector,  and 
return  an  x.  The  first  (C-1)  values  returned  form  the 
partition  values  which  will  divide  those  data  elements  info 
cells. 

c.  The  Observed  Cell  Count 

The  data  elements  are  compared  to  the  partition 
values  contained  in  the  vector  PARTV.  An  element  falls  in 
the  ith  cell  if  it  is  larger  than  PARTV(i-l)  but  less  than 


or  equal  to  PARTY(i).  Elements  less  than  PARTY  (1)  are  in 
the  first  cell;  elements  larger  than  PARTY  (C-1)  are  in  the 
last  cell.  The  observed  cell  counts  are  contained  in  the 
vector  RCCT. 

d.  The  Test  Statistic 

For  each  cell,  subtract  EJ  (i)  from  RCCT(i)  , 
square  that  difference,  and  divide  that  square  by  EJ(i). 
Sub  over  all  cells.  This  is  the  test  statistic  T,  with 
degrees  of  freedom  (C-1)  . 

2.  Program  Method  of  0  Deration 

The  subroutine  Gdfit  calls  the  subroutine  Muvar  to 
find  the  sample  mean  and  variance  if  parameters  must  be 
estimated.  The  subroutine  Cpvgen  figure  the  EJ,  CUM,  and 
COMP  vectors.  The  vector  PARTY  is  figured  in  one  of  five 
subroutines,  depending  on  the  theoretical  distribution  spec¬ 
ified;  unid,  normd,  expd,  erld,  weid.  The  observed  cell 
counts  RCCT  are  found  in  Celcnt.  The  test  statistic  T  is 
computed  in  the  subroutine  Gdfit. 

3 •  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variable:  1. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Gd_fit_n,  Gd_fit_e,  Gd_fit_u, 

Gd_f it_er ,  Gd_fit_w,  Gd_fit 

(2)  Crunch:  Gdfit,  Cpvgen,  Muvar,  Celcnt,  Unid, 
Normd,  Expd,  Erld,  ieid 

(3)  External:  Shsort 


XIII.  WILCOXO  N  SIGHED  RANK  TEST 

A.  OVERVIEW 

1 .  General  Purpose 

With  two  paired  variables  (X,Y),  the  signed  rank 
test  determines  whether  the  expected  values  of  populations  X 
and  Y  could  be  the  same. 

This  requires  interval  data.  If  the  data  is  only 
ordinal,  use  the  sign  test. 

2.  Level  o£  Mea  sura  men  t 
Interval 

3 .  Assumptions 

The  pairs  (Xi,Yi)  are  random  samples  from  the  popu¬ 
lation  of  all  pairs. 

4 .  Hypotheses 

a.  Two  tailed  test 

(1)  Null  hypothesis  hO:  E(X)=E(Y) 

(2)  Alternative  hypothesis  his  E(X)*E(Y) 

b.  One  tailed  test 

(1)  Null  hypothesis  h3:E(X)<E(Y) 

(2)  Alternative  hypothesis  hi:  E  (X)  >E  (Y) 

c.  One  tailed  test 

(1)  Null  hypothesis  h3:  E(X)>E(Y) 

(2)  Alternative  hypothesis  hi:  E  (X)  <E  (Y) 

B.  ACTION  REQUIRED  EY  USER  OF  PROGRAM 

1 .  User  Decisions  about  Program  options 

To  make  the  test  more  flexible,  a  constant  term  can 
be  added  to  the  higher  indexed  variable,  so  that  the  test 
becomes  a  question  of  the  relationship  of  E(X)  and  E(Y)«-b. 


By  default,  the  value  of  b  is  0;  but  the  user  car.  specify 
any  real  number. 

Lochiavar  also  asks  for  the  hypothesis  type  and 
level  of  significance. 

2.  Program  Activation  when  Using  Data  Values 
a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  9.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  ••  V **  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  9.  Enter  "  UM, '»V", . . .  "Z"  for  desired  output 


file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochiavar: 

(1)  Enter  indices  of  variables  to  be  used  in 
test.  Enter  blocks  or  pairs  to  be  excluded,  if  any. 

(2)  Enter  whether  a  constant  term  is  to  be  added 
to  the  value  of  the  higher  indexed  variable,  Y.  By  default, 
the  constant  is  0. 

(3)  Enter  the  letter  of  the  type  of  hypothesis 
and  the  hypothesised  level  of  significance. 

4 •  Te?t  Execution 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

5.  Output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file. 

Crunch  will  provide  the  sum  of  negative  ranks  and 
sum  of  positive  ranks,  as  well  as  the  test  statistic  T  in 


all  cases.  If  there  are  fewer  than  2D  pairs,  table  look  up 
is  required;  otherwise,  the  program  will  compute  the  empir¬ 
ical  level  of  significance. 

When  the  number  of  untied  pairs  is  less  than  20,  the 
output  will  also  contain  the  vector  Dif,  the  differences 
Yi-Xi,  in  ranked  order. 


C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 


Description  of  Cal culat ion  of  Test  Statistic/Level 


of  Sicni 


Rank  the  values  of 


a.  Sums  of  the  Negative  and  of  the  Positive  Ranks 

Two  sums  are  calculated,  one  for  the  negative 

differences—  those  pairs  in  which  Xi>Yi— and  one  for  the 
positive  differences.  Pairs  in  which  Xi=Yi  are  not  consid¬ 
ered.  Subtract  Xi  from  Yi ,  to  form  the  vector  Dif.  Take 
the  absolute  value  of  Dif  to  form  Adif.  Rank  the  values  of 
Adif  tc  form  the  vector  Rank.  If  Yi<xi,  the  ith  value  of 
Rank  is  added  to  the  sum  of  negative  ranks,  then  that  ith 
value  is  multiplied  by  negative  1.  If  Yj>Xj,  the  jth  value 
of  Rank  is  added  to  the  sum  of  positive  ranks. 

b.  Computation  of  Test  Statistic 

Tha  vector  Rank  was  originally  the  ranks  of  the 
absolute  differences;  now,  the  ranks  for  the  negative 
differences  have  been  multiplied  by  -1.  Sum  the  vector  Rank 
to  find  the  numerator  of  the  test  statistic.  Sum  the 
squares  of  the  elements  of  Rank  to  find  the  denominator  of 
the  test  statistic.  Divide  numerator  by  denominator. 

c.  Level  of  Significance 

If  the  number  of  untied  pairs  is  20  or  less,  use 
table  4-13  *  ref  1],  using  the  sum  of  positive  ranks. 
Otherwise,  Crunch  computes  an  approximation,  using  tha  test 
statistic  T  and  the  standard  normal  distribution. 


2.  Program  Met  hod  of  0  per  at  ion 

The  subroutine  Wilcox  finds  the  Dif  and  Adif 
vectors,  and  calls  the  subroutine  Ranque  (which  in  turn 
calls  Rnq)  to  find  the  vector  Ranh.  Wilcox  computes  the 
sums  of  the  positive  and  negative  ranks,  as  well  as  the  test 
statistic  T.  If  the  number  of  untied  pairs  is  20  or  less, 
it  will  display  the  sums  of  ranks.  Otherwise,  it  will  call 
the  subroutine  Normv  to  find  the  approximate  empirical  level 
of  significance,  and  display  whether  to  reject  the  null 
hypothesis. 

3 .  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables:  2;  maximum  number  of  pairs:  200. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  wilcoxon 

(2)  Crunch:  Wilcox, Ranque, Rnq, Normv 

(3)  External:  Shsort 


XIV.  NONPAR A  METRIC  CORRELATION 

A.  OVERVIEW 

1 .  General  Purpose 

With  a  pairs  cf  data  elements,  X  and  Y,  test  whether 
the  value  of  X  is  statistically  independent  of  the  value  of 
I,  or  if  it  varies  directly  or  inversely  with  Y. 

Three  different  test  statistics  are  available  by 
running  this  one  program:  Pearson's  rho,  Spearman's  rho,  and 
Kendall's  tau. 

2.  Level  of.  gsasuremeg t 

Interval  for  the  Pearson's  rho  to  have  meaning. 
Ordinal  for  Spearman's  rho  and  Kendall’s  tau. 

3 .  Assumptions 

X,T  are  a  random  sample  of  population  pairs. 

4 .  Hypothesis 

Where  rho  is  a  measure  of  correlation, -1<rho<=1 

a.  Null  hypothesis 

HO:  rho(X,Y)  =  0,  a  necessary  condition  for 

statistical  independence 

b.  Alternative  hypothesis 
HI:  rho  (X , Y)  *  0 

B.  ACTION  REQUIRED  EY  USER  OF  PROGRAM 

1 .  User  Decisions  about  Prooraa  Options 

The  user  chooses  which  pair  of  variables  from  a  data 
set  are  used  in  a  test,  and  whether  to  exclude  any  of  the 
pairs  from  the  test.  As  described  below,  the  computation  of 
Kendall's  taa  uses  an  algorithm  of  complexity  level 
n-squared  (when  comparing  every  pair  with  every  other  pair); 


if  there  are  more  than  40  pairs  Lochinvar  offers  the  user 
the  chance  to  skip  the  Kendall's  tau  test.  If  so,  only 
Spearman's  rho  and  Pearson's  rho  (which  may  have  no  meaning 
if  the  data  is  ordinal)  will  be  computed.  Otherwise,  all 
three  will  be  produced. 

2.  Pro era a  Act ivation  when  Usi^g  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  10.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "VM  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  10.  Enter  "  U", "V",. . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  If  there  are  aore  than  two  variables  in  the 
data  set,  enter  the  indices  of  the  two  to  be  used.  Enter 
pairs  to  be  excluded,  if  any. 

(2)  If  there  are  more  than  forty  pairs,  enter 
whether  the  Kendall's  tau  should  be  computed. 

4 .  Test  E  xecuticn 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

5-  Output  Available  t£  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  zhe  top  of  the  output 
file.  Spearman’s  rhe,  Pearson's  rho,  and  Kendall's  tau  are 
listed. 


C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 


1 .  Description  of  Calculation  of  Test  Statistics 

a.  Pearson's  Rho 

Pearson's  rho  is  SXY  divided  by  square  root  of 
SXX  times  SYY.  The  deviations  of  X  are  found  by  subtracting 
the  mean  of  X  from  each  value  of  X;  deviations  cf  Y  are  Yi 
minus  mean  of  Y.  The  product  of  deviation  of  Xi  times  devi¬ 
ation  of  Yi,  summed  over  all  pairs,  is  SXY.  The  sum  of 
squared  deviations  of  Xi  is  SXX;  the  sum  of  squared  devia¬ 
tions  of  Yi  is  SYY. 

b.  Spearman*  s  Rho 

Rank  the  values  of  X.  Rank  the  values  of  Y. 
Those  pairs  of  ranks  (R(Xi)  , R  ( Yi) )  ,  are  then  used  to  compute 
SXY  divided  by  square  root  of  SXX  times  SYY.  That  is, 
Spearman's  rho  is  Pearson's  rho  performed  on  the  ranks  of 
the  values  rather  than  the  values  themselves. 

c.  Kendall's  Tau 

Tau  is  roughly  the  measure  of  the  number  of 
concordant  pairs  minus  disc oncordant  pairs  divided  by  total 
number  of  possible  pairing.  Concordance  implies  positive 
correlation;  e.g. ,  if  pair  of  the  data  set  number  4  (X4,Y4) 
has  an  X  value  greater  than,  say,  X5,  and  a  Y  value  greater 
than  Y5,  then  this  is  one  small  indication  that  there  is 
positive  correlation  between  x  and  Y.  Therefore,  the  4th 
and  5th  pairs  of  the  data  set  form  to  make  one  concordant 
pair.  If  X6  is  greater  than  X4,  but  Y6  is  less  than  X4, 
this  is  a  little  evidence  of  negative  correlation;  the  4th 
and  6th  pairs  of  the  set  form  to  make  one  disconcordant 
pair.  Ties  in  either  X  or  Y  mean  that  the  comparison  yields 
neither  concordance  nor  disconcordaace. 

To  compute  tau,  order  the  pairs  (Xi,Yi)  in 
ascending  order  of  X's.  Compare  the  first  pair  (the  one  with 
the  lowest  X  value)  to  every  other  pair.  If  XI  is  tied  with 


Xi  or  Y 1  with  Yi,  go  to  the  next  pair;  if  not,  (Y1  <  Yi) 
implies  concordance,  (YI  >  Yi)  implies  disconcor  dance; 
increment  the  relevant  count.  After  the  first  pair  is 
compared  with  all  others,  the  second  pair  is  compared  with 
all  succeeding  pairs  (3,4,...,k),  incrementing  counts  as 
necessary.  Each  pair  is  compared  to  all  succeeding  pairs, 
until  Xk-1  is  compared  to  Xk.  Kendall’s  tau  equals  the 
number  of  concordant  pairs  minus  the  number  of  disconcordant 
pairs,  all  divided  by  the  total  possible  number  of  pairings 
(k(k-1)/2).  If  all  pairings  are  concordant,  tau  will  equal 
1;  all  disconcordant  pairings  will  yield  an  -1. 

2.  grama  i3a£.k°i  °£  a  paction 

a.  Pearson's  and  Spearman's  Rhos 

The  subroutine  Rhoer  calculates  SXY,  SXX,  SYY  for 
any  input  values,  and  returns — inter  alia — rho.  For 

Pearson's  rho  use  the  values  of  the  data  set.  For 

Spearman's  rho,  use  subroutine  Rnq  to  rank  the  values  of  the 

data  set,  and  send  these  ranks  to  Rhoer. 

b.  Kendall's  Tau 

Since  the  values  have  already  been  ranked,  use 
the  indices  of  ranked  X  to  erder  the  pairs  (R  (Xi)  ,R  (Yi) )  . 
Ranks  work  the  same  as  values  for  finding  concordance  or 
disconcordarca,  but  ties  are  easier  to  discern  with  ranks. 
Hith  nested  do  for  loops,  compare  the  ith  pair  (i  =*  1  to 

k-1)  with  pairs  j  *  i+1  to  k;  R(Xi)  =  R(Xj)  or  R  (Yi)  »  R(Yj) 

means  to  continue  with  looping,  otherwise,  if  R(Yi)  <  R(Yj) 
then  increment  concordant  counter,  if  R(Yi)  >  R(Yj)  then 
increment  disconcordant  counter.  At  the  end  of  the  nested 
loops,  divide  the  difference  of  the  concordant  counter  minus 
the  disconcordant  counter  by  the  term  (k(k-1)/2). 

3 •  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points;  400.  Maximum 
number  of  pairs:  200. 

b.  Subroutines/Procedures  Used 


XV .  NONPAR  A  METRIC  REGRESSION 

A.  OVERVIEW 

1 .  General  Pprpose 

Regression  tests  the  effect  of  the  value  of  indepen¬ 
dent  variable  X  has  on  the  value  of  the  independent  variable 
T.  The  linear  regression  model  assumes  that  the  effect  of  X 
on  Y  is  linear,  that  is,  E  ( Y |  X)  =E  (Y)  ♦  bl  (X-E  (X) )  .  This  test: 

(1)  Finds  estimates  of  bl. 

(2)  Predicts  values  of  Y  given  values  of  X 

(3)  Test  if  the  linear  model  is  correct 

(4)  Finds  confidence  limits  for  the  value  of  bl 

2.  level  of  Measurement 

Ordinal,  to  test  the  linear  model 
Interval,  for  the  other  purposes  listed  above 

3.  Assumptions 

a.  Random  sampling  of  pairs.  X  may  be  specified,  as 
long  as  the  Y  portion  of  the  pair,  given  X,  is  independent. 

b.  The  linear  model  is  the  correct  model. 

4  •  Hypothesis 

Where  bl  is  slope  the  parameter;  bl*  hypothesized. 

a.  Null  hypothesis 
HO:  b  1*b  1  * 

b.  Alternative  hypothesis 
HI:  b1*b1  * 

B.  ACTION  REQUIRED  EY  OSER  OF  PROGRAM 

1.  User  Decisions  j,bout  Prooraa  Options 

a.  The  user  chooses  the  '-■* o  variables  to  be  used, 
and  designates  which  one  is  dependent.  The  program  will 
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automatically  fit  the  least  squares  model,  finding  bl.  The 
intercept  parameter  bO,  2  (Y  |  X)  =bO+b1*X,  is  also  computed. 

b.  The  user  can  input  values  for  X  to  predict  values 
of  I  based  on  the  least  squares  parameters. 

c.  The  user  can  specify  a  hypothesised  bl*  to  test 
whether  a  linear  model  is  appropriate.  The  default  value  is 
bl*  =  0;  this  is  the  classical  test  of  whether  an  indepen¬ 
dent  variable  predicts  a  dependent  variable. 

d.  The  user  can  request  confidence  intervals  on  the 
slope  parameter  bl.  As  described  below,  the  algorithm  to 
compute  .  those  intervals  is  of  degree  of  complexity  n 
squared,  that  is,  for  n  pairs(x,y),  n(n-1)/2  comparisons 
must  be  made  and  stored  to  find  the  confidence  intervals. 
For  larger  values  of  n,  the  user  should  consider  not 
requesting  this  option. 

If  the  user  does  choose  to  have  the  confidence 
intervals  computed,  the  value  W  will  be  requested  by 
Lochinvar.  That  value  is  explained  by  Conover  [Ref.  1:  p. 
267  ],  and  contained  in  table  A-12  of  that  reference. 

2.  Program  Activation  when  Psing  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LQCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  11.  If  data  needs  zo  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  11.  Enter  "O",11?",...  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 
When  prompted  by  Lochinvar; 

(1)  Enter  indices  of  variables  to  be  used  in 
Enter  pairs  to  be  excluded,  if  any. 


test . 


(2)  Enter  values  cf  X  to  compute  E ( Y |  X)  based  on 
least  squares  linear  fit. 

(3)  Enter  hypothesized  slope  parameter  bl*. 

(4)  Enter  whether  confidence  limits  are  desired. 

3 •  Execution 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  A  vail  able  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  least  squares  fit  parameters  will  be  listed,  and 
the  Spearman*s  rho  statistic  for  the  specified  value  of  bl . 
If  the  user  requested  expected  values  of  ¥  for  some  values 
of  X,  the  values  of  X  and  Y  will  be  listed.  Finally,  if 
confidence  bounds  were  requested,  those  will  be  given. 

C.  ACTIONS  PERFORHED  BY  ‘THE  PROGRAM  CRUNCH 

1 .  Description  of  Calculation  of  Test  Statistic/ Level 

siaailisaace 

a.  Find  the  Regression  Parameters  b1,b0 

This  step  is  identical  to  parametric  simple 
regression.  Find  the  means  of  X  and  Y,  and  the  vectors  of 
deviations  (Xi-mean  of  X),  (Yi-mean  of  Y) .  Multiply  the 
corresponding  deviations  of  X  and  Y  together,  and  sum  the 
products  to  fora  SXY.  Square  the  deviations  of  x,  then  sum 
those  squares,  to  form  SXX.  Sum  the  squared  deviations  of  Y 
to  find  SYY.  The  slope  parameter  bl  is  SXY/SXX.  The  inter¬ 
cept  parameter  bO  is  the  mean  of  Y  minus  bl  times  the  mean 
of  X.  The  expected  value  of  Y  given  X  is  bO  plus  bl  times 
X.  Residuals  are  the  difference  between  the  expected  and 
actual  value  of  Y. 
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b.  Test  the  Value  of  X  as  a  Predictor  of  Y 

In  parametric  regression,  it  is  standard  to  rest 
whether  the  slope  parameter  bl  could  be  0  in  the  entire 
population.  In  this  package,  any  value  bl*  can  be  set  as 
the  null  hypothesis.  The  test  checks  to  see  if  the  resi¬ 
duals  of  the  fit  using  bl*  are  correlated  with  the  values  of 
X;  if  not,  then  the  fit  is  considered  adequate.  The  resi¬ 
duals  are  (Yi-b1*xi),  the  vector  of  residuals  and  the  inde¬ 
pendent  variable  X  are  com  paired  using  the  Spearman's  rho 
test.  See  section  XIV  for  details  of  that  test. 

c.  Predictions  of  Y  based  on  Least  Squares  Fit 

With  the  least  squares  parameters  bO  and  bl 
already  computed.  Crunch  multiplies  each  given  X  times  bl, 
adds  bO,  and  displays  the  predicted  Y. 

d.  Confidence  Bounds  for  the  Slope  Parameter  bl 

Form  the  vector  of  slopes  S.  S(1)  is  (X(1)-X(2)) 

divided  by  (Y(  1)  -Y  (2) )  ;  S  (2)  is  (X(1)  -X  (3)  ) /(Y  (1) -Y  (3) )  ,  and 
so  on.  With  k  pairs  (X,Y),  there  are  N=k(k-1)/2  elements  of 
the  slope  vector,  when  comparing  each  i  with  each  other  j 
once. 

Sort  the  vector  S.  The  indices  of  the  confidence 
limits  are  r  (0.5*(N-W))  and  s  (0.5*  (N*W)  +1)  ,  where  N  is  the 
total  number  of  slopes  and  W  is  the  user  entered  value  from 
table  A- 12  of  Conover.  If  necessary,  round  r  or  s  up  to 
integers.  The  lower  limit  on  bl  is  s  (r)  ,  the  upper  limit  is 
S  (s)  . 

2.  Program  flat  hod  o£  Operation 

The  subroutine  Regret  calls  the  subroutine  Rhoer  to 
find  the  least  squares  parameters  bl  and  bO.  Regret  figures 
the  residuals  for  the  specified  bl*,  and  sends  the  residuals 
to  subroutine  Correl  to  have  Spearman's  rho  calculated. 
Regret  figures  the  vector  of  slopes  S;  that  vector  is  sorted 
by  Shsort;  Regret  finds  indices  s  and  r  and  the  confidence 
limits  to  bl. 
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Maxi  mum 


3.  Proara  a  Cha  racteri sties 

a.  Lis  it  at  ions 

Maximum  number  of  data  points:  400. 
number  of  variables:  2. 

b.  Subroutines/Procedures  Used 


(1)  Locbinvar:  Regret 

(2)  Crunch:  Regret,  Rhoer,  Correl 

(3)  External:  Shsort 


XVI .  MQNDTONIC  REGRESSION 


A.  OVERVIEW 

1 .  general  Purpose 

With  independent  variable  X,  and  dependent  variable 
Y,  find  the  E(Y|X)  equation,  based  on  a  least  squares  fit  of 
the  ranks  of  X  and  Y .  Linear  interpolation  finds  values  from 
X  to  E(rank  of  X)  to  E(rank  of  Y)  to  E(Y). 

2 .  Level  of  Measurement 

Interval 

3 .  Assumptions 

a.  The  sample  is  a  random  sample 

b.  The  regression  of  Y  on  X  is  monotonic. 

4 .  Hypothesis 

This  test  does  not  produce  a  test  statistic;  the 
assumption  is  that  the  monotonic  model  is  correct.  Instead, 
it  will  predict  expected  values  of  Y  for  given  values  of  X, 
Also,  it  can  produce  a  trace  of  the  monotone  regression, 
which  is  analogous  to  the  straight  line  of  the  least  squares 
fit  of  linear  regression. 

B.  ACTION  REQUIRED  EY  OSER  OP  PROGRAM 

1.  Oser  Pecisio ns  about  Program  Potions 

The  user  must  specify  the  index  for  the  independent 
value  X,  and  dependent  variable  Y.  The  user  can  include 
values  of  X  to  have  predicted  a  corresponding  value  of  Y. 
The  user  can  request  a  trace  of  the  monotone  regression. 

2.  Program  Activation 

a.  Initial  Choices 

See  section  III.C  for  initial  choice  details. 
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In  CMS,  type  LOCHI.  If  dara  is  already  in  a  data 
file,  enter  "  Y  "  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  12.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  whan  offered  a  choice  of 
tests,  type  12.  Enter  " 0", "V", . . . "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  indices  of  variables  X  and  Y.  Enter 
pairs  to  be  excluded,  if  any. 

(2)  Enter  whether  to  predict  values  of  Y  for 
specified  values  of  X.  If  yes,  enter  values  of  X. 

(3)  Enter  whether  a  trace  is  desired. 

3 .  Test  Execution 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

4.  Qutput  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  In  all  cases,  there  will  be  listed  the  parameters  bO 

and  bl,  the  intercept  and  slope  of  the  least  squares  fit  on 

the  ranks  of  X  and  Y.  If  the  user  specified  X*s  to  predict 

Y's,  the  predicted  values  will  be  listed.  Finally,  if  a 

trace  was  requested,  the  pairs  of  X’s  and  Y’s  will  be  given. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1  •  Descript  ion  of  Calculations 

a.  Lea s t  Squares  Fit  on  Ranks  of  X  and  Y 

Sort  and  rank  X  and  Y.  With  n  pairs,  X  (i)  and 
Y(j)  are  the  order  statistics,  RX(i)  and  RY(j)  are  the 
ranks,  i*1,...,n,  h  =  1 , . . . , a .  Compute  the  least  squares  fit 


to  find  bO  and  bl,  such  that  E  (RY  |  RX)  =bQ+b  1  *RX.  See  section 
XV  for  details  of  least  square  computations. 

b.  Predication  of  Y  Based  on  Values  of  X 

Linear  regression  will  extrapolate  values  of  Y 
for  a  given  value  of  X  if  that  X  is  outside  the  range  of  the 
observed  values  X(1),  ...,  X(n).  Monotone  regression  will 

not  extrapolate;  if  the  user  specifies  an  X  outside  the 
range  of  the  observations,  that  X  will  not  yield  an  expected 
Y. 

Let  Xp  be  one  of  the  values  the  user  specified  to 
predict  a  Y,  Yp.  Find  the  pair  of  values  of  the  order 
statistics  of  X  which  bracket  Xp,  X  (i)  <Xp<=X  (i+ 1 )  .  The 
expected  rank  of  Xp,  HXp,  must  fall  between  those  order 
statistics'  ranks,  R X  (i)  <RX p<=RX  (i*  1 )  .  Use  linear  interpo¬ 
lation  to  find  RXp ;  RXp=  RX(i)  (Xp-X  (i)  ) / (X  (i+ 1) -X (i) ) 
*(RX  (i+1)  -RX(i) )  . 

Use  the  parameters  bO  and  bl  -to  find  RYp,  the 
rank  of  Yp:  RYp=bO  +  b1*RXp. 

Find  the  ranks  of  Y  which  bracket  RYp, 
RY  ( j )  <RYp<=  RY  ( j  +  1)  .  The  value  of  Yp  lias  between  Y(j)  and 
Y(j+1).  Use  linear  interpolation  to  find  Yp:  Yp=  Y(j)  * 
(RYp-RY(j))/(RY  (j+1)~RY(j))  *  (Y  ( j  + 1 )  -  Y  ( j )  )  . 

c.  Trace  of  Monotone  Regression 

The  trace  is  a  set  of  lines,  between  a  sequence 
of  points  (Xi,Yi),  which  describe  the  monotone  regression. 
When  plotted,  the  trace  can  be  used  to  predict  a  value  of  Y 
for  an  X.  This  is  similar  to  using  the  straight  line  of 
linear  regression,  Y=bO  ♦  b1*X,  for  prediction. 

To  find  the  endpoints  of  the  trace,  use  X ( 1 )  and 
X(n)  ,  and  the  procedure  of  part  b,  to  find  E(Y|X(1})  and 
E  (Y  |  X  (n)  )  . 

Intermediate  points  of  the  trace  will  use  the 
values  of  Y  to  predict  expected  X.  Each  value  of  Y  has  a 
rank,  RY,  which  is  used  with  bO  and  bl  to  find  RXt  (trace). 
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RXt=  (RY-fcO)  /b  1 .  For  each  RXt,  find  the  pair  of  RX's  which 
bracket  it.  Use  linear  interpolation  to  find  Xt. 

2.  Program  Method  of  Q  peration 

The  subroutine  Monore  calls  Rnq  to  rank  and  sort 
values.  The  subroutine  Correl  finds  the  least  squares 
parameters  bO  and  bl  from  the  rank  vectors  RX  and  RY.  The 
subroutine  ESTY  performs  estimations  of  Y  for  given  X's, 
calling  Rtov  (which  calls  Interp)  to  do  the  linear  interpo¬ 
lations.  The  subroutine  Trace  performs  the  trace,  calling 
Vror,  Rtov  and  Interp. 

3 .  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables:  2. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Monoreg 

(2)  Crunch:  Monore,  Rtov,  Vtor,  Esty,  Interp,  Rng 

(3)  External:  Shsort 


XXVII.  MEDIAN  JEST 


A.  OVERVIEW 

1  •  General  Purpose 

With  two  or  mors  samples,  this  test  will  determine 
if  all  the  populations  from  whinh  the  samples  were  drawn 
could  have  the  same  median. 

As  described  below,  the  package  allows  combining  of 
data  set  variables  into  composite  variables  in  order  to  test 
population  groups  for  common  median. 

The  median  may  be  a  surrogate  for  expected  value. 

2.  Level  of  Mea  sure  men  t 

Ordinal 

3 .  Assumptions 

The  variable  were  randomly  drawn  from  the  respective 
populations,  and  are  mutually  independent. 

4  •  Hypothesis 

With  k  populations,  X  ( 1)  ,  X  (2)  , . . . ,  X  (k)  ,  and  the 

median  denoted  as  M, 

a.  Null  hypothesis 

HO:  M  (XI)  =M(X2)  =.  ..=M  (Xk) 

b.  Alternative  hypothesis 

HI:  M(Xi)#M(Xj)  for  at  least  one  of  the  pairs 

B.  ACTION  REQUIRED  ET  USER  OF  PROGRAM 

1 •  Oser  Decisions  about  Proaraa  Options 

The  user  chooses  which  variables  of  the  data  set  to 
include  in  the  test. 

Aggregate  variables  may  be  formed  by  having  Crunch 
combine  the  expected  cell  counts  of  several  variables  into 


one  or  more  aggregate  variables.  For  example,  let  data  set 
Farmland  contain  six  variable,  in  order,  A  through  F. 
Variable  A  is  corn  yield  per  acre  of  farm  land  treated  with 
fertilizer  and  pesticide.  B  is  corn  yield  with  fertilizer 
but  no  pesticide.  C  is  corn  yield  with  pesticide  but  no 
fertilizer.  D  is  corn  yield  per  acre  from  land  untreated  by 
either  fertilizer  or  pesticide.  Variables  E  and  F  have 
other  values.  Variables  A  and  C  could  be  combined  into 
aggregate  variable  1,  and  B  and  D  into  2,  to  test  whether 
the  median  yield  with  pesticide  is  sgual  to  the  median  yield 
without  pesticide.  Similarly,  A  and  B,  and  C  and  D,  could 
be  combined  into  aggregate  variables  1  and  2,  to  test  the 
median  of  yields  with  and  without  fertilizer. 

Proper  planning  and  labeling  of  the  variables  will 
facility  this  aggregation. 

2.  Pro gram  Activation  when  Osing  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  13.  If  data  needs  to  be  entered, 
enter  "S"  for  first  question,  '* V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  13.  Enter  "  0” ,  "V" , . . .  "Z"  for  desired  outpu* 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochiavar: 

(1)  Enter  indices  of  variables  to  be  used  in 
test.  Enter  blocks  or  pairs  to  be  excluded,  if  any.  Enter 
whether  to  use  aggregate  variables. 

(1)  If  aggregate  variables  are  not  used,  enter 


indices  of  variades  to  be  used  in  the  test. 


(2)  If  aggregates  are  used,  Lochinvar  will 
prompt  for  a  choice  on  each  data  sat  variable,  i=1  to  the 
number  of  variables  in  the  data  sat.  If  the  ith  data  set 
variable  is  to  be  excluded  from  the  test,  enter  0.  If  it  is 
to  be  included,  enter  the  index  of  the  aggregate  variable 
into  which  it  should  be  combined.  For  example,  with  the 
Farmland  data  set  described  above,  Lochinvar  would  prompt 
user  choices  on  data  set  variable  i=  1  to  6.  Let  the  test 
be  a  comparison  of  yields  with  or  without  pesticide,  thus 
variables  A  and  C  into  aggregate  variable  1,  B  and  D  into  2, 
and  E  and  F  excluded.  When  prompted  for  the  data  set  var  1, 
enter  1.  For  data  set  var  2,  enter  2.  For  data  set  var  3, 
enter  1.  For  data  set  var  4,  enter  2.  For  data  set  var  5 
and  6,  enter  0. 

4 .  Test  Execution 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

5 .  Output  Available  to  the  User 

*  The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  program  will  print  the  test  statistic  and  the 
degrees  of  freedom,  as  well  as  a  two  by  number  of  variables 
contingency  table,  showing  how  many  elements  of  each  vari¬ 
able  are  less  than  or  equal  to  the  grand  median  of  all  vari¬ 
ables,  and  how  many  are  greater  than. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1 •  Description  of  Calculation  of  Test  statistic 

a.  Calculation  of  the  Call  Counts  of  the  M  Matrix 
The  M  matrix  has  two  rows  and  a  column  for  each 
variable  in  the  test.  The  A  row  has  the  number  of  elements 
in  each  variable  less  than  or  equal  to  the  grand  median;  the 


B  row  has  the  number  of  elements  greater  than  rhe  grand 
median.  The  values  of  all  the  variables  are  copied  into  a 
vector,  which  is  sorted;  the  median  of  all  those  values  is 
the  grand  median.  The  counts  of  the  cells  of  the  marrix  M 
are  set  to  0.  Each  element  of  the  first  variable  is  compared 
to  the  grand  median;  if  less  than  or  equal,  the  A  row  first 
column  cell  count  is  incremented,  otherwise  increment  B  row 
first  column  cell  count.  After  all  elements  of  the  first 
variable  are  compared  to  the  grand  mean,  do  the  second  vari¬ 
able,  incrementing  the  2  column's  cell  count,  et  cetera, 
b.  Computing  the  Test  Statistic 

Sum  all  counts  of  the  A  row  to  find  Atot,  and  all 
counts  of  B  to  find  atot.  Sum  Atot  and  Btot  to  find  N.  Sum 
the  first  column  counts  to  find  NAV(1)  ,  the  second  column 
counts  to  find  NAV(2),  et  cetera.  The  expected  cell  count 
for  cell  aj  is  Ato t*  NAV  ( j)  /  N ;  for  bj,  Btot *NA V  ( j)  /N.  The 
test  is  in  essence  the  chi-square  test  for  independence  (r*c 
contingency  table),  with  the  test  statistic  as  the  sum  over 
each  cell  of  observed  minus  expected  count  squared,  divided 
by  the  cell  expected  count.  The  actual  equations  used 
exploit  the  structure  of  the  problem.  The  degrees  of 
freedom  are  (r-1)(c-1)  =  number  of  variables  minus  one. 

2.  program  Method  of  3 Deration 
a.  Aggregation  of  Variable 

The  subroutine  Media  assumes  that  aggregate  vari¬ 
ables  are  always  used.  When  not,  the  program  uses  k  aggre¬ 
gate  variables  where  k  equals  the  number  of  data  set 
variables  included  in  the  test;  aggregate  variable  1  is  data 
set  variable  1 , . .. , aggregat e  variable  k  is  data  set  variable 
k.  When  aggregates  are  in  fact  specified,  their  values, 
contained  in  rp  vector,  will  be  used.  In  either  case,  the 
vector  AGVARI  will  contain  the  indices  relating  each  data 
set  variable  to  an  aggregate  variable. 
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The  Matrix  M  is  a  fiction.  The  vector  NA  is  the 
count  of  elements  less  than  or  equal  to  the  grand  median; 
it  is  arranged  according  to  aggregate  variable  indices.  N3 
is  the  count  of  elements  greater  than  the  grand  median.  In 
a  do  fcr  loop,  i=1  to  the  number  of  data  set  variables,  k  is 
set  to  AGVARI(i) ,  and  each  element  of  the  ith  data  set  vari¬ 
able  is  compared  to  the  grand  median.  If  the  element  is 
lass  than  or  equal  to  the  grand  median,  increment  NA  (k) 
(=NA  (AGVARI  (i)  )  ,  otherwise,  increment  N3(k). 

c.  Compute  the  Test  Statistic 

NA(i)  and  NB(i)  are  summed  to  find  NAV(i).  All 
elements  of  NA  are  summed  to  find  A,  all  elements  of  NB  are 
summed  to  find  B.  A  plus  3  equals  N.  The  test  statistic  is 
computed  using  the  specialized  equations  of  Conover  [ref  1, 
p.  172].  The  degrees  of  freedom  are  the  number  of  aggregate 
variables  minus  one. 

3 .  Program  characteristics 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables  or  aggregate  variables:  15. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Median,  Ag_med, Which_vec, None_ex 

(2)  Crunch:  Media 

(3)  External:  Shsort 


XVIII.  KROSKAL  WALLIS  TEST 


A.  OVERVIEW 

1  *  General  Purpose 

With  a  data  set  of  more  than  two  variables, 
XI, X2,. . . ,Xk,  test  whether  the  expected  values  of  all  vari¬ 
ables  are  equal. 

If  this  test's  null  hypothesis  is  rejected,  then  it 
is  possible  to  determine  which  of  the  pairs  of  variables 
have  an  unequal  mean. 

If  the  data  set  is  blocked,  consider  instead  of  this 
test  the  Cochran  (nominal)  ,  Frieiaan  (ordinal)  or  Quade 
(interval)  tests.  If  paired,  consider  the  Sign  (ordinal)  or 
wilcoxcn  (interval)  tests. 

2 .  Level  of  Measure  man  t 

Ordinal 

3 .  Assumptions 

a.  Each  variable  is  a  random  sample  from  within  its 
popu laticn. 

b.  The  sampling  was  mutually  independent  among  the 
populations. 

c.  The  population  distributions  are  identical,  or 
else  some  of  the  populations  tend  to  have  higher  values  than 
others. 

4*  fiZ£othg§fs 

a.  Null  hypothesis 

HO:  E  (XI)  =E  (X  2)  s...E(Xk) 

b.  Alternative  hypothesis 

HI:  E(Xi)#E(Xj)  for  at  least  one  pair  of  popula¬ 

tions  represented  by  the  data  set. 


B.  ACTION  REQUIRED  EY  USER  OF  PROGRAM 


1 .  User  Decisions  about  Program  Options 

The  user  must  specify  which  variables  of  a  data  set 
are  included  in  the  test. 

After  the  test  is  run.  Crunch  will  present  the 
overall  test  statistic,  and  a  series  of  comparisons  between 
the  variables.  If,  and  only  if,  the  null  hypothesis  has 
been  rejected  by  the  overall  test  statistic  may  the  user 
infer  any  information  by  the  comparisons  between  the  pairs 
(Xi,Xj)  of  variables. 

2.  Program  Activation  when  Usiaq  Data  Values 

a.  Initial  Choices 

See  secticn  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  14.  if  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  14.  Enter  "  U", "V",. . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochiavar,  enter  indices  of 
variables  to  be  used  in  test. 

3 •  I§st  E xecqtion 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  va liable  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  overall  test  statistic  will  be  given.  Also 

listed  will  be  comparisons  of  pairs  of  variables;  these 
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comparisons  only  have  meaning  if  the  null  hypothesis  has 
been  rejected.  The  paired  comparison  are  essentially  a  t 
test  run  on  the  ranks  of  the  two  variables  being  compared, 
testing  the  hypothesis  that  their  mean  ranks  are  the  same. 
Crunch  supplies  the  absolute  difference  in  their  mean  ranks 
and  the  pooled  standard  deviation;  the  user  must  find  the 
appropriate  value  of  t  to  conduct  the  t  test. 


C.  ACTIONS  PERFORMED  BT  THE  PROGRAM  CRUNCH 

1.  Calculation  of  Test  Statistic  for  the  Nu 
Hypothesis 

a.  Find  the  Sums  of  Variable  Ranks  Ri 

Rank  all  values  in  the  data  set.  Sum  the  ranks 

in  the  ith  variable  to  get  the  value  Ri.  The  number  of  data 

elements  in  the  ith  variable  is  ni,  so  the  average  rank  in 

the  ith  variable  is  Ri/ni.  The  sum  of  ni  over  all  k  vari¬ 

ables  is  N. 

b.  Find  the  Test  Statistic 

The  test  statistic  numerator  is  the  sum  over  k 
treatments  of  (Ri**2)/ni,  minus  ( (M*  1 )  **2)  *N/4.  To  find  the 
test  statistic  denominator,  S2,  first  sum  over  all  data 
elements  the  square  of  the  ranks.  Subtract  from  this  sum  of 
squares  the  term  (  (N+1)  **2)  *N/4)  .  Divide  that  difference 
by  (N-1)  to  get  S2.  Divide  the  numerator  by  S2  to  find  the 
test  statistic  T. 

c.  Make  Comparisons  of  the  Pairs  of  Variables 

As  mentioned  above,  this  step  is  the  student  t 
test  on  the  ranks  of  two  variables  to  see  if  their  popula¬ 
tion  mean  could  be  equal;  it  means  nothing  if  the  null 
hypothesis  has  not  been  rejected. 

For  each  pair  of  variables  Xi,Xj,  find  the  abso¬ 
lute  difference  in  mean  ranks,  |  Ri/ni-R  j/n  j| .  Find  the 
pooled  variance,  (S2 *  (N- 1-T )  / (N-k) )  times  ( 1/ni*1/nj)  .  The 
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pooled  standard  deviation  is  the  square  root  of  the  pooled 
variance.  If  the  absolute  difference  is  greater  than  the 
pooled  standard  deviation  times  the  desired  value  of  t, 
reject  the  assertion  that  the  mean  of  populations  i  and  j 
are  the  same. 

2.  Program  Met  hod  of  0  per  at  ion 

The  subroutine  Kruwal  calls  Ranque  to  rank  the 
values  of  the  data  set.  In  a  nested  do  for  loop,  i=1  to  k, 
j=1  to  ni,  the  intermediate  terms  of  the  statitics  are 
computed.  Kruwal  prints  the  overall  test  statistic  to  the 
specified  output  file.  It  then  makes  pairwise  listings  of 
the  absolute  difference  of  mean  ranks  and  the  corresponding 
pooled  standard  deviations. 

3 •  Program  Cha racteri sties 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables/treatments:  15. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Kruwal 

(2)  Crunch:  Kruwal,  Ranque 

(3)  External:  Shaort 


XIX.  SQUARED  RANK  TEST  FOR  EQUAL  VARIANCE 

A.  OVERVIEW 

1 .  General  Purpose 

With  two  or  more  treatments  in  a  data  set,  tes 
whether  the  population  variances  of  those  treatments  ar 
equal. 

2.  Level  of  Hsa  sure  man  t 
Interval 

3.  Assumptions: 

Randoi  sampling  within  each  treatment  populatior 
and  among  treatment  populations.  Level  of  measurement  : 
interval. 

4.  Hypotheses 

a.  Two  tailed  test,  where  X,X  are  treatments 

(1)  Null  hypothesis  hO:  var  (X)  *var  (X) 

(2)  Alternative  hypothesis  hi:  var  (X)  *var  (Y) 

b.  One  tailed  Aest,  two  treatments 

(1)  Null  hypothesis  hQ:  var  (X)  >=var  (Y) 

(2)  Alternative  hypothesis  hi:  var { X)  <var ( Y) 

c.  One  tailed  test,  two  treatments 

(1)  Null  hypothesis  hQ:  var  (X)  =<var  (Y) 

(2)  Alternative  hypothesis  hi:  var  (X)  >var  (Y) 

d.  Number  of  treatments  (Xi)  are  three  or  more 

(1)  Null  hypothesis  hO:  var  (X 1)  =var  (X2) 

. . .  *var  (Xk) 

(2)  Alternative  hypothesis  hi:  var  (X i)  #var  (X  j) 
for  some  i,  1,  indices  of  the  data  sat 


B.  ACTION  REQUIRED  BY  USER  OF  PROGRAM 


1 •  Use  -  Decisio  ns  about  Program  Options 

The  program  allows  the  user  to  choose  which  treat¬ 
ments  to  include  in  the  test. 

2.  Program  Activation  when  Using  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  L3C HI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  15.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  15.  Enter  "  0” ,  "V" ,. . .  "Z "  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  b.-  Lochinvar,  Enter  indices  of 
variables  to  be  used  in  -tes 

4 .  T^est  Execution 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

5.  Output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  appear  on  the  top  of  the  output  file. 
Also  included  are  the  test  statistic  and  the  number  of 
treatments  used  to  compute  the  statistic. 


C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1.  Description  of  calc  illation  of  Test  Statistic  (T  VT2) 

a.  Find  the  ranks  of  each  absolute  deviation: 

Compute  the  sample  mean  for  each  of  k  treatments. 

Subtract  the  treatment  mean  from  each  of  the  treatment's 
elements.  Take  absolute  values  of  each  of  those  differ¬ 
ences.  Rank  all  N  absolute  differences  (N=total  elements  in 
data  set) . 

b.  Complete  intermediate  statistics: 

(1)  SUMR,  the  sum  of  all  ranks. 

(2)  SUMR2,  the  sum  of  the  square  of  all  ranks. 

(3)  SUMR4,  the  sum  of  all  the  fourth  power  of 

each  rank. 

(4)  SUMR J ,  the  k  element  vector  with  the  sum  of 
the  ranks  for  each  treatment. 

(5)  SUMR2J,  the  k  element  vector  with  the  sum  of 
squared  ranks  for  each  treatment. 

(6)  R2MU,  the  average  squared  rank:  SUMR2/N. 

(7)  R mu ,  the  average  rank:  sumr/n. 

c.  For  two  treatments  .'ompute  the  statistic  T1 

(1)  Numerator:  (SUMR2J  (1)  -n1*R2MU) 

(2)  Denominatior :  SUMR4*n1*n2/(N*(N-1)  )  ,  minus 

(RMU**2)  *n1*n2/(N-1) 

(3)  T  1=nu  merator/denominat  or 

d.  For  three  or  more  treatments  compute  T2 

(1)  Numerator:  difference  between  these  two 

terms,  sum  over  k  treatments  of  (SUMR2J**2)  /ni,  minus 
(R2MU**2)  *N 

(2)  Denominator:  (SUMH4-N*  (R2MU**2)  )  /  (N-1) 

(3)  T2=numerato  r/denominator 

2.  Program  Operations 

The  program  uses  subroutine  MU V A R  to  find  treatment 
.  The  subroutine  Squark  finds  the  absolute  deviations; 
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XX.  fliSIiSX  I1ST  POE  EfiOAJ,  VARIANCE 


A.  OVERVIEW 

i  •  Sami  sa2£2ss 

Test  for  equal  variance  among  the  k  underlying  popu¬ 
lations  of  the  variables  or  treatments,  reflected  by  the 
values  for  those  treatments  in  the  data  set. 

2.  itsisl  2l  Siismaai 

Interval 

3.  &5s§aa£^laaa 

Random  sampling  within  each  treatment,  independence 
of  sampling  among  treatments.  Interval  level  of  measurement 

4 .  Hypothesis 

a.  Null  hypothesis 

HO:  var  (XI)  *var  (X2)  *.  ..»var  (Xk)  ,  where  k=  number 

of  treatments 

b.  Alternative  hypothesis 

HI:  var  (Xi)  #var  ( Xj)  for  at  least  one  pair 

B.  ACTION  REQUIRED  BT  USEB  OF  PROSRAH 

i*  flsat  fi££iai2M  £i2S£a§  2Eti°as 

The  program  allows  the  user  to  choose  which  treat¬ 
aents  to  include  in  the  test. 

2.  aisn  2§iai  2ata  values 

a.  Initial  Choices 

See  section  III.c  for  initial  choice  details. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  *T"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  rype  16.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 


entered,  choose  to  do  a  test;  vhea  offered  a  choice  of 
tests,  type  16.  Enter  "  a", "V",. . . "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  proepted  by  Lochiavar,  Enter  indices  of 
variables  to  be  used  in  test. 

4.  Iasi  SllSS IttiSS 

To  perform  the  test,  after  the  above  entry  of  data 
and  paraseters,  type  CBONCH  while  in  CHS. 

5.  output  Available  t£  the  User 

The  program  cbonch  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "0  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  there  are  the  aaxisum  and  miniaua  variances  and 
indices  of  the  associated  treatments.  Piaally,  there  is  the 
P  statistic,  to  be  compared  to  a  table  prepared  for  the 
Hartley  test. 

C.  ACTIONS  PE  BP  OB  NED  BT  THE  PBOGBAH  CBUNCH 

1.  Description  o£  Calculation  2f  Test  statistic 

Crunch  does  the  following;  Compute  the  sample  vari¬ 
ance  for  each  treatment.  Find  the  smallest  and  largest 
variance.  Divide  the  largest  variance  by  the  smallest  vari¬ 
ance  to  yield  the  test  statistic  P.  Compare  the  test 
statistic  to  a  special  table  to  see  if  the  null  hypothesis 
should  be  rejected. 

2.  era  an  a  flatiaa  21  aatmalaa 

The  treatment  variances  are  figured  in  the  subrou¬ 
tine  HOVAB.  The  subroutine  HABTLE  scans  the  variances  to 
find  the  largest  and  the  smallest,  divides  the  former  by  the 
latter,  and  gives  that  number  as  tha  test  statistic  F. 


3.  maai  sfr&sassqsls&cg 

a.  Limitations 

The  aaziaua  number  of  data  elements  is  400; 
aaziaoa  number  of  treataents  is  15. 

b.  Subroutine/Procedures  Involved 

(1)  Lochinvar:  Hartley,  Whicb_vec 

(2)  Crunch:  Har tle,Buvar 

(3)  External:  Shsort 


the 
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XXI.  PRIEDM AN  JJSI  ££2  22SIIH  IS  ST 


A.  OVERVIEW 

i .  ssaml  £ai£osa 

To  perform  a  treataent  and  block  test,  similar  to 
ANOVA,  using  only  ordinal  data.  In  fact,  the  data  may  be 
rankable  only  within  each  block;  i.e.,  no  relationship 
between  data  values  in  different  blocks  is  necessary. 

The  Friedman  test  is  for  data  sets  of  complete 
block  design,  i.e.,  each  treataent  is  applied  to  each  block. 

The  Durbin  test  is  a  generalized  Friedman  *est,  for 
use  with  incosplete  block  design.  With  t  treatments  and  b 
blocks,  each  tr eatment  is  applied  to  r  blocks  (r<b)  ,  and 
each  block  receives  k  treatments  (k(t) .  The  user  must  guar- 
antee  that  the  design  is  balanced,  and  insert  a  filler  value 
in  place  of  sissing  data  values  whan  entering  the  data  set. 

Each  test  will  generata  a  sat  of  differences 
between  pairs  of  treatment  sunned  ranks.  If,  and  only  if, 
the  null  hypothesis  is  rejected,  these  differences  nay  be 
used,  in  conjunction  with  a  standard  deviation  value  and  the 
student's  t  distribution,  to  determine  if  there  is  a  signif¬ 
icant  different  between  two  treatments. 

2*  mil  21  atiaitiam 

Ordinal  within  each  block. 

3.  Assumptions 

Values  within  blocks  are  indapendent  among  blocks. 

«•  flz&s&kaii! 

a.  Mull  hypothesis 

HOt  Each  possible  ranking  within  the  block  is 
equally  likely  (that  is,  the  treataent  effects  are  equal) 

b.  Alternative  hypothesis 


r  fc-f  *3* 


HI:  At  least  one  of  the  treatments  tends  to  get 

higher  ranks  than  at  least  one  other  treatient. 

B.  ACTICN  REQUIRED  EY  USER  OF  PROGRAM 

1*  2J5I  £mai  2£li2ES 

The  user  must  decide  which  treatments  and  blocks  to 
include  in  the  test.  If  a  incomplete  block  design  is  to  be 

used,  the  user  must  specifically  enter  the  data  set  for  this 

purpose:  there  will  be  t  variables,  and  b  elements  entered 
for  each  variable.  In  those  elements  which  have  a  no  data 
value  (a  treatment  which  was  not  applied  to  some  block)  , 
enter  a  filler  value.  In  a  data  sat,  there  can  ba  only  one 

filler  value,  and  it  must  be  different  than  any  of  th9  data 

values.  For  example,  if  all  the  data  values  are  positive, 
•1  could  be  used  as  a  filler. 

2.  Program  Activation  when  3§iai  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "I"  to  first  guestion,  then  enter  file  name; 
when  offered  a  test,  type  17.  if  data  needs  to  be  entered, 
enter  "Nw  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  17.  Enter  "  U", "V" , . . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  whether  to  use  the  Friedman  or  Durbin 

test. 

(2)  If  the  Friedman  test  is  chosen,  enter  the 
treatments  to  be  used  and  any  blocks  to  be  excluded. 
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(2)  If  the  Durbin  test  is  chosen,  enter  the 
tre&taents  to  be  used,  any  blocks  to  be  excluded,  k=  number 
of  tre&taents  applied  to  each  block,  r*  number  of  blocks  to 
vhich  each  freataent  is  applied,  and  the  filler  value. 

*  -  isst  B&ftgaUga 

To  perform  the  test,  after  the  above  entry  of  data 
and  paraaeter3,  type  CBUNCH  while  in  CSS. 

5.  Qst£&  iaUafeli  IS  Iks.  Usar 

The  pcograa  CBONCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  H0  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  overall  test  statistic  for  the  null  hypothesis 
will  be  given.  In  addition,  the  difference  between  pairs  of 
treatments*  summed  ranks  will  be  listed;  these  are  mean¬ 
ingful  only  if  the  null  hypothesis  is  rejected. 

C.  ACTIONS  PEBFOBHED  BY  THE  PBOGRAH  CBONCH 

i .  Bsasrigljaa  si  QslsslzlLsz  si  Xu*  Statistic 

a.  Ose  of  Filler,  If  Durbin  Test 

If  the  Durbin  test  is  indicated,  find  the  maximum 
value  in  the  data  set.  Add  1  to  that  maximum  value  to  form 
HP1.  In  a  do  for  loop,  substitute  HP  1  for  the  filler  value, 
and  create  the  integer  vector  INTEST.  INTEST (i)  equals  1 
if  the  ith  element  in  the  data  set  is  not  the  filler  value, 
0  if  the  ith  eleaent  is  a  filler. 

b.  Bank  Values  Block  by  Block 

In  both  tests,  find  the  ranks  of  the  values  of 
the  data  set  block  by  block.  Load  those  ranks  into  the  R 
vector. 

In  the  Durbin  test,  the  filler  elements  will  have 
a  higher  rank  than  those  of  data  elements,  since  the  fillers 
were  loaded  with  HP  1 .  Therefore,  after  all  the  ranking  is 
complete,  multiply  the  raak  vector  times  INTEST;  this  sets 
the  ranks  of  all  filler  eleaent s  to  3. 


c.  Sms  of  Treataent  Ranks  and  Total  Subs 

Square  each  rank  of  the  rank  vector  R,  and  sua 
those  squares  over  all  treatments  and  blocks  to  fora  A2.  Sum 
the  ranks  of  each  treataent  to  fora  the  vector  RJ;  RJ  (j) 
contains  the  sua  of  ranks  of  the  jth  treataent. 

d.  Test  Statistic  for  the  Priedaan  Test 

Square  each  eleaent  of  the  RJ  matrix;  sua  those 
squares;  divide  that  sua  of  squares  by  the  nuaber  of  blocks 
to  find  B2.  The  dencainator  of  the  test  statistic  is  A2-B2. 
Let  b  be  the  auaber  of  blocks ,  t  the  nuaber  of  treataents. 
The  nuaerator  of  the  test  statistic  is  (b-1)  *  (B2  -0.25bk* 

(k*1)**2).  Divide  the  nuaerator  by  the  denominator  to  get 
the  test  statistic  T2.  Compare  12  to  the  P  distribution 
with  (k-1)  and  < k—  1 )  (b-1)  degrees  of  freedom 

e.  Test  Statistic  for  the  Durbin  Test 

Sua  the  squares  of  RJ.  Multiply  that  sum  of 
squares  tiae  12(t-1).  Divide  that  product  by  rt  (k-1)  (k+1), 
where  r  is  the  nuaber  of  blocks  to  which  each  treataent  is 
applied,  and  k  is  the  nuaber  of  treataents  applied  to  each 
block.  This  quotient  is  the  first  term  of  the  test 
statistic.  The  second  term  is  3r(t-1)  (k+1)  divided  by 
(k-1)  .  Subtract  the  second  term  froa  the  first  tera  to  find 
the  test  statistic  T.  T  is  compared  to  the  chi-square 
distribution  with  t-1  degrees  of  freedom. 

f.  Paired  Coaparisons  of  Treataents 

Crunch  will  print  the  following  numbers  in  all 
cases,  but  they  only  have  meaning  when  the  null  hypothesis 
is  rejected.  For  each  i,j  of  the  treatments,  the  absolute 
difference  between  the  suas  of  treatment  ranks  is  listed. 
The  nuaber  that  is  equivalent  to  the  standard  deviation  is 
also  computed.  For  absolute  differences  greater  than  the 
standard  deviation  tiaes  the  t  quantile  (1-alpha/2),  the 
treataents  can  be  considered  to  have  a  different  effect. 


2.  Program  Method  of  Operation 

The  subroutine  Friedm  performs  most  of  the  computa¬ 
tions.  The  subroutine  Ranque  ranks  the  values  block  by 
block.  The  subroutine  Parcom  does  the  comparisons  of  treat¬ 
ment  sums  of  ranks. 

3.  Progmi  &|£gcteri£ti£s 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables/trea tme nts:  15. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Friedman 

(2)  Crunch:  Friedm,  Ranque,  Rnq,  Parcom 

(3)  External:  Shsort 


XXII.  QPADB  TEST 


A.  OVERVIEW 

1 .  Ssasal  £iE2£se 

With  interval  data  organized  into  t  treatments  and  b 
blocks,  test  whether  all  treatments  have  the  same  effect. 
If  not,  treatients  can  be  compared  for  differences  in  pairs. 

2.  Lsial  s£  fla&saitBgai 

Interval 

3.  &52ai£tlQfls 

a.  The  values  of  each  blocks  are  independent  of  the 
values  of  the  other  blocks. 

b.  The  sampling  within  each  block  can  yield  a  range 
of  the  elements;  that  range  must  be  rankable  among  blocks 

a .  aieg&aaig 

a.  Null  hypothesis 

HO:  Each  possible  ranking  within  the  block  is 
equally  likely  (that  is,  the  treatment  effects  are  equal) 

b.  Alternative  hypothesis 

HI:  At  least  one  cf  the  treatments  tends  to  get 

higher  ranks  than  at  least  one  other  treatment. 

B.  ACTIOH  REQUIRED  BT  USER  OF  PR03RAH 

1*  flaai  2S£i3i22£  &S211  SI22ES1  2£tioss 

The  user  must  specify  the  treatments  to  be  included 
in  the  test,  and  whether  any  blocks  are  to  be  excluded. 
Crunch  will  provide  the  absolute  difference  between  pairs  of 
treatments;  if  and  only  if  the  null  hypothesis  is  rejected, 
these  comparisons  can  be  used  to  test  whether  the  effects  of 
the  pairs  are  equal. 


2.  Program  Activation  when  Using  Dana  Values 

a.  Initial  Choices 

See  section  III.c  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "I"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  18.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "7"  for  second;  enter  the 
data;  when  offered  a  choice  of  tests,  type  18.  Enter 
"O","?",. .. MZ"  for  desired  output  fils. 

b.  Variable  and  Parameter  Choice 

Hhen  prompted  by  Lochia var:  Enter  indices  of 

variables  to  be  used  in  test. 

3.  1231 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISPING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  tast  statistic  T  and  its  degrees  of  freedom  are 
listed.  Pinally,  the  absolute  difference  between  the  pairs 
of  sums  of  treatment  ranks  are  given.  These  differences, 
when  used  with  a  standard  deviation  computed  by  Crunch  and 
the  t  distribution,  can  be  used  to  check  for  equal  effect 
between  pairs  of  treatments. 

C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1.  Description  fil  Calculation  last  Statistic 

a.  Rankings  within  and  among  Blocks 

Bank  each  element  in  the  block;  assign  the 
elements  to  the  matrix  of  ranks  R.  Subtract  the  largest 
element  from  the  smallest  element;  that  becomes  the  block's 


range,  stored  in  the  vector  R .  After  all  blocks  have  had 
their  elements  ranked,  rank  the  vector  R;  those  ranks  form 
the  vector  Q. 

b.  The  Size  Matrix  S 

With  t  equal  to  the  number  of  treatments,  the 
average  rank  of  R  is  (t+1)/2.  Subtract  that  average  rank 
from  each  element  of  R.  Multiply  the  reduced  ranks  of  the 
first  block  by  Q(1),  the  reduced  ranks  of  the  second  block 
Q(2),  et  cetera,  tc  fora  the  matrix  s. 

c.  Sums  of  the  Treatment  Elements  of  S 

Sum  the  elements  of  the  jth  treatment  of  S  over 
all  blocks  to  form  the  jth  element  of  the  vector  SJ.  Square 
each  element  of  SJ,  sum  those  squares  over  all  treatment, 
and  divide  by  b,  the  number  of  blocks,  to  find  51.  Square 
all  elements  of  S,  and  sum  all  those  squares  to  find  A1 . 

d.  Test  Statistic 

The  tes-t  statistic  T1  is  (b-1)  time  B1  divided  by 
the  quantity  A1  minus  B1 .  Compare  T1  to  an  F  statistic  with 
degrees  of  fraedoa  (t-1)  and  (b-1)  (t-1). 

e.  Pairwise  Comparisons 

These  values  have  meaning  only  if  the  null 
hypothesis  is  rejected.  Compute  the  absolute  difference 
between  each  pair  of  the  vector  SJ.  If  that  difference  is 
greater  than  the  student's  t  quantile  times  the  computed 
standard  deviation,  the  pair  of  treatments  can  be  said  to 
have  a  different  effect.  The  standard  deviation  is  found  by 
taking  the  square  root  of  (2b  (A1-B1)  /  (b-1)  (t-1) )  . 

2.  Program  Method  of  0 Deration 

The  subroutine  Quade  calls  Ranque  to  find  the  matrix 
R  and  the  vector  Q.  The  test  statistic  is  computed  in 
Quade,  as  is  the  standard  deviation.  The  subroutine  Parcom 
makes  the  pairwise  comparison. 


3  •  Proqra a  cfra  racteristics 


a.  Limitations 


Maximum  number  of  data  points:  400. 
number  of  variables/treatments:  15. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Quade 

(2)  Crunch:  Quade,  Ranqua,  Rnq,  Parcom 

(3)  External:  Shsort 


Maxi  mum 


XXIII.  COCHR  AN  TEST  FOR  RELATED  ;J3S ER 7 ATIONS 

A.  OVERVIEW 

1 .  QS^SUl  ?ai£2se 

With  a  data  set  of  b  blocks,  to  which  k  treatments 
have  been  applied,  test  whether  the  treatments  have  been 
equally  effective.  "Effective"  must  be  reducible  to  two 
categories,  mutually  exclusive  and  collectively  exhuastive. 
Here,  these  categories  will  called  success  and  failure. 

If  the  data  is  not  reducible  in  this  fashion,  the 
Priedman  (ordinal)  or  Quads  (interval)  tests  are  available. 

2.  Level  of  Measurement 
Nominal 

3 .  Assumptions 

The  blocks  are  a  random  sample  from  among  all  blocks 

* .  aiEgtfresjs 

a.  Null  hypothesis 

HO:  The  treatments  are  equally  effective 

b.  Alternative  hypothesis 

HI:  There  is  a  difference  in  effectiveness  among 

the  groups. 

B.  ACTION  REQUIRED  ET  USER  OF  PROGRAM 

1.  User  Decisions  lb2ai  Procrai  Options 

The  user  must  decide  the  standard  for  success  or 
failure  of  each  treatment.  If  the  data  is  nominal,  the  user 
should  enter  cell  counts.  see  paragraph  2,  below.  if  the 
data  is  ordinal,  the  user  may  enter  a  partition  value  for 
each  treatment  of  a  data  3et;  see  paragraph  3.  If  this 
latter  method  is  used.  Crunch  will  classify  each  value  of  an 


entered  data  set  as  a  success  ot  failure  by  its  relationship 
to  the  partition  value  for  its  treatment. 

If  partition  values  are  usad  in  conjunction  with  a 
data  set,  the  user  oay  specify  which  variables  of  the  data 
set  are  to  be  included  in  the  test,  and  may  designate  blocks 
to  be  excluded. 

2.  Program  Activation  when  Osing  Cell  Counts 

a.  Initiation  and  test  selection 

In  CMS,  type  LOCHI.  Enter  WN"  for  no  to  question 
about  previously  entered  data.  Enter  "C"  for  cell  counts. 
Enter  . . ."Z"  for  desired  output  file,  when  offered  a 

choice  of  tests,  enter  19. 

b.  cell  count  entry 

Lochinvar  will  ask  how  many  treatments  and  how 
■any  blocks  are  included  in  the  test.  after  those  numbers 
are  entered,  Lochinvar  will  ask  whether  block  1,  treatment  1 
was  a  success.  Enter  yes  or  no.  The  program  will  prompt 
for  each  treatment  in  block  1,  then  move  to  block  2,  and  so 
forth,  through  the  b*k  choices.  The  matrix  of  1*s 
(successes)  and  0*s  (failures)  will  be  displayed,  and  the 
user  has  the  chance  to  correct  the  values  for  any  block  (the 
program  will  prompt  for  entries  of  each  treatment  of  a  spec¬ 
ified  block) . 

3.  Prog^m  Activation  when  Osing  Data  Values 
a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  MT”  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  19.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  19.  Enter  '» 0" , "V",. . .  "Z"  for  desired  output 
file. 
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b.  Variable  and  Parameter  Choice 

Lochinvar  will  ask  if  treatment  1  is  in  the 
test.  If  yes,  then  it  will  then  ask  the  partition  value  for 
treatment  1,  whether  successes  lie  above  or  below  that 
value.  The  same  questions  will  be  asked  for  each  treatment. 
After  each  treatment  is  either  excluded  or  given  a  partition 
value,  Lochinvar  will  offer  the  user  the  chance  to  exclude 
blocks  frca  the  test. 

*  •  1221 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

5.  salE j&  AzallaSis  12  ii 2 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  test  statistic  T  will  be  listed. 

C.  ACTIONS  PEAFORHED  BY  THE  PROGRAH  CRUNCH 

1 .  EsasaavjQfl  2I  £§isgia tioa  2 1  12§1  nziiziis. 

a.  Calculation  of  Block  and  Treatment  Totals 

If  the  user  chooses  the  method  of  cell  counts 
(paragraph  2,  above)  ,  Lochinvar  will  sum  the  successes  over 
each  of  b  blocks  and  over  each  of  k  treatments.  These 
totals  will  be  in  the  cells  vector  of  line  3a  of  the  options 
file  (see  Appendix  0  for  details  of  the  options  file),  to  be 
read  by  Crunch. 

If  the  user  chooses  to  enter  partition  values 
(paragraph  3  above)  ,  Crunch  uses  that  information  to  compute 
the  block  and  treatment  totals. 

b.  computing  the  Test  Statistic 

The  numerator  of  the  test  statistic  involves  the 
sums  of  successes  for  each  treatment.  The  total  number  of 
successes  N  is  found  by  summing  each  treatment  number  of 


successes  Ci;  N  divided  by  the  number  of  treatments  k  is  the 
average  number  of  successes  per  treatment.  Find  the  sum  of 
squared  deviations  by  summing  over  k  treatments  the  square 
of  (Ci-N/k)  .  Multiply  that  sum  of  squares  by  k(k-1)  to  get 
the  test  statistic  numerator. 

The  denominator  of  the  test  statistic  is  the  sum 
over  all  b  blocks  of  the  number  of  successes  in  block  i 
times  the  number  of  failures  in  block  i. 

Divide  the  numerator  by  the  denominator. 

2.  Program  .flaJikad  o£  0 peration 

a.  When  Using  Cell  Counts 

In  the  procedure  Cochran,  Lochinvar  prompts  the 
answer  of  yes  or  no  to  the  question  of  whether  block  i, 
treatment  j,  is  a  success.  If  yes,  it  increments  the  total 
of  successes  in  the  count  for  block  i  and  treatment  j.  The 
total  for  block  i  is  contained  in  the  ith  element  of  the 
vector  CELLS;  the  total  for  the  jth  treatment  is  contained 
in  the  (j+b)th  element  of  CELLS,  where  b  is  the  number  of 
blocks.  The  number  of  blocks  (NROW)  and  number  of  treat¬ 
ments  (MCOL)  are  written  in  Line  3  of  the  options  file;  the 
vector  CELLS  is  written  in  Line  3a.  The  Crunch  subroutine 
Celler  reads  those  values,  and  sends  them  to  the  subroutine 
Cochrn,  which  computes  the  test  statistic. 

b.  When  using  Partition  Values 

In  the  procedure  Cochran,  Lochinvar  asks  the 
partition  value  of  each  treatment  included  in  the  test; 
those  values  are  entered  in  the  odd  numbered  elements  of 
vector  PART_V,  that  is,  1,3,. . . ,  (2k-1)  ,  where  k  is  the 
number  of  treatments.  In  the  even  numbered  elements  of 
PAHT_V  is  a  +1.0  if  successes  are  below  the  preceding  parti¬ 
tion  value,  or  -1.0  if  the  successes  are  above  it.  For 
example,  FART.V  of  3.0  -1.-0  7.0  1.0  means  that  there  are  two 
treatments;  in  the  first  treatment,  successes  are  above  3.0, 
in  the  second,  successes  are  below  7.0.  The  number  of 


blocks  (NROW)  and  the  nuaber  of  treataents  (N COL)  are 
written  in  the  Line  3  of  the  options  file;  PART_V  is  written 
into  line  3a. 

The  Crunch  subroutine  Cellar  calls  the  subroutine 
Coch  to  use  P&RTJT  and  the  data  elaaents  values  to  find  the 
totals  of  successes  for  the  blocks  and  treataents.  It 
places  those  counts  (in  the  sane  foraat  as  CELLS,  describe 
in  subparagraph  a,  above)  into  the  vector  RCCT.  That  vector 
is  used  by  the  subroutine  Cochrn  to  coapute  the  test 
statistic. 

3 .  Program  Characteristics 

a.  Limitations 

Shan  the  user  enters  cell  counts,  the  sun  of  the 
treataents  and  blocks  cannot  exceed  50  (b+k)  ,  and  the  total 
nuaber  of  cells  (b*k)  cannot  exceed  400. 

whan  the  prograa  coaputas  the  treataent  and  block 
totals,  the  aaxiaua  nuaber  of  data  points  is  400;  the 
aaxiaaa  nuaber  of  treataents  is  15.  The  sua  of  treatments 
and  blocks  (b+k)  cannot  exceed  50. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Cochran,  Cel_test 

(2)  Crunch:  Cochrn, Cellar 

(3)  External:  Shsort 
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XXIV.  KOLMOGOROV  £2£2 


I.  OVER  7X2  H 
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1.  General  la 12255 

With  a  saaple  of  30  or  less  data  points,  test  to  see 
whether  the  population  from  which  the  saaple  is  drawn  is  of 
a  specified  distribution  with  specified  parameters.  The 
choice  of  hypothesised  distributions  is  uniform,  normal, 
exponential,  weibull,  and  erlang  (gamma  with  an  integer 
shape  paraaetar)  . 

The  Lilliefors  test  is  identical  to  the  Kolmogorov 
test  except  that  it  estiaates  parameters  froa  the  saaple; 
normal  and  exponential  distributions  can  be  hypothesised. 

2.  iszsi  ai  asaiaisasai 

Interval 

3.  AgaugeSlSBi 

The  saaple  is  a  random  sample  from  the  population. 

4.  glpgthggeg 

With  P*  (x)  as  the  theoretical  distribution  with 
specified  parameters,  and  S(x)  as  the  population*s  under¬ 
lying  distribution  function,  the  hypotheses  are; 

a.  Two  tailed  test 

(1)  Bull  hypothesis  hO:  F*(x)»S(x)  for  all  x 

(2)  llternative  hypothesis  hi;  P*(x)#S(x)  for 
at  least  one  x  in  range  of  saaple 

b.  One  tailed  test 

(1)  Hull  hypothesis  h3;  F*(x)£S(x)  for  all  x 

(2)  llternative  hypothesis  hi;  P*(x)>s(x) 

c.  One  tailed  test 

(1)  Hull  hypothesis  h3;  F*(x)>S(x) 

(2)  Alternative  hypothesis  hi;  F*(x)<S(x) 
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B.  ACTION  REQUIRED  BY  USER  OF  P ROSEAU 


1 .  User  Decisjops  a boat  Program  Potions 

If  a  data  set  has  sore  than  one  variable,  the  user 
must  specify  which  is  used.  If  the  chosen  variable  has  more 
than  30  elements,  the  chi-square  goodness  of  fit  test  will 
automatically  be  given.  Otherwise,  the  user  must  specify 
the  distribution  to  use,  and  all  its  parameters,  from  among 
the  following: 

a.  Normal  (specify  mu,  sigma  squared) 

b.  Exponential  (specify  lambda*rate, scale) 

c.  Uniform  (specify  a*lower  bound, b*upper  bound) 

d.  Erlang  (specify  n*integer, shape;  lambda=scale) 

e.  aeibull  (specify  alpha*shape,  lambda=scale) 

2.  gsaaai  is&Aiailaa.  zteR 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  20.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  20.  Enter  "  U",,,V,,r. . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  index  of  variable  to  be  used  in  test. 

(2)  Enter  number  of  theoretical  distribution. 

(3)  Enter  parameters  as  prompted. 

(4)  Eater  letter  of  type  hypothesis. 


3.  Ssgt  SS&caU2£ 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  Available  to  tfce  2§1£ 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  sec  and  variables  used  are  at  the  top  of  the  output 
file.  The  values  of  the  largest  difference  F*(x)-S(x),  the 
smallest  (most  negative)  difference,  and  the  greatest  abso¬ 
lute  difference  are  displayed.  Depending  on  the  hypothesis. 
Crunch  selects  the  proper  value  of  thase  three  as  the  test 
statistic,  and  displays  that  value. 


C.  ACTIONS  PERFORMED  BY  THE  PROGRAM  CRUNCH 

1.  Description  Calc ulation  2£  lest  Statistic 

a.  Empirical  Cumulative  Distribution  Function  (CDF) 
Sort  the  data  elements,  giving  order  statistics 

X(  1)  ,  • . .  ,  X  (n) ,  where  n  is  the  number  of  data  elements.  The 
empirical  cdf  equals  0  for  values  of  X  lower  than  the 
smallest  X,  1(1).  At  X(1)  ,  the  cdf  jumps  to  1/n;  the  cdf 
increases  by  1/n  at  each  new  order  statistic.  At  X  (n)  ,  the 
empirical  cdf  equals  1,  and  remains  1  for  all  values  of  x 
greater  than  the  sample.  The  empirical  cdf  is  stored  in  the 
vector  S. 

b.  points  on  the  Theoretical  CD F 

With  a  specified  distribution  and  parameters, 
find  the  value  of  the  cdf  for  each  value  of  the  order 
statistics  X(1)  ,...  ,X(n)  .  For  example,  with  exponential 
(lambda*2)  specified  and  X*1.5,  the  value  of  the  theoretical 
cdf  is  1.0-exp  (- (2*1 .5) )  »0.9502.  Store  these  values  in  the 
vector  F. 
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CDF*  s 
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Difference  between  Empirical  and  Theoretical 


With  a  sample  of  size  n,  there  are  2n  differences 
between  S  and  F.  For  example,  at  X(1),  s  jumps  from  0  to 
1/n,  so  that  F(1)  is  compared  to  both  values.  At  each  order 
statistic  value,  the  theoretical  distribution  F  (i)  is 
compared  to  S(i-1)  and  S  (i)  .  Crunch  keeps  track  of  the  tmin 
(most  negative  difference,  or  if  all  are  positive,  the 
smallest  positive)  ,  and  taax  (most  positive,  or  if  all  are 
negative,  the  closest  to  zero), 
d.  Test  Statistics 

For  a  two  tailed  t9st  (hypothesis  A)  ,  the  test 
statistic  is  the  larger  of  the  absolute  values  of  tmax  and 
tain.  Fcr  hypothesis  B,  taax  is  the  test  statistic.  For 
hypothesis  C,  tain  is  the  test  statistic. 

2.  Program  Method  o£  Operation 

The  subroutine  Kolmog  compute  the  empirical  cdf,  by 
a  do  for  loop  i*1  to  n,  with  S(i)*i/n.  Shsort  orders  the 
sample  values  to  fora  order  statistics.  The  theoretical 
cdf's  are  found  in  one  of  the  following  subroutines:  normf, 
expf,  unif,  erlf,  and  veibf. 

The  subroutine  Findt  finds  the  differences  between  S 
and  F  for  each  value  of  X.  For  i*1,  F  is  compared  to  0  and 
S (1 )  •  Then,  in  a  dc  for  loop  i*2  to  a,  F  (i)  is  compared  to 
S(i)  and  S(i-1).  These  2n  difference  are  stored  in  the 
vector  T.  Another  do  for  loop,  j*1  to  2n,  finds  tmax  and 
tain.  From  those  two  values,  the  greatest  absolute  differ¬ 
ence  is  found. 

3 •  Program  Character! sties 
a.  Limitations 

Maximum  number  of  data  points:  400. 
number  of  variables/treatments:  15. 


Maximum 


A.  OVERVIEW 


1 .  Sgfigial  2SIB25S 

With  a  saaple  of  lass  than  31  elements,  this  tasts 
whether  the  saaple  came  from  a  specified  distribution, 
either  a  noraal  or  exponential.  The  parameters  used  for  the 
test  are  the  aaxiaua  likelihood  estimates  (MLE)  ,  derived 
froa  the  saaple  aean  and  variance.  Those  two  values  are 
computed  and  displayed  in  the  output  file;  however, 
Lilliefors  tests  only  the  distribution  type,  not  the  param¬ 
eter  value.  (Any  value  other  than  the  HLE  would  have  a 
higher  probability  of  rejecting  the  null  hypothesis). 

If  specific  parameters  aust  be  tested  for,  use  the 
Kolmogorov  test. 

•  2.  levs!  as asaisssai 

Interval 

3 .  assaaeUaai 

The  data  values  are  a  random  sample  of  the  popula¬ 
tion. 

BYB2lh3SlS 

WithP*(x)  as  the  hypothesised  distribution  func¬ 
tion,  with  HLE  paraaeters,  and  S(x)  as  the  actual  population 
distribution 

a.  Hull  hypothesis 

HO:  P*(x)*S(x)  for  all  x 

b.  Alternative  hypothesis 

HI:  P*(x)  «(x)  for  at  least  one  x 


B.  ACTION  REQUIRED  EY  USER  OF  PROSEAM 

1 .  User  Dec is ions  about  Program  Options 

If  more  than  one  variable  is  in  the  data  set,  the 
user  must  specify  which  one  to  utilize  for  the  test.  If 
that  variable  has  more  than  31  elements,  the  goodness  of  fit 
test  will  be  offered  automatically.  Otherwise,  choose 
whether  to  test  for  the  normal  or  the  exponential  distribu¬ 
tion. 

2.  Program  Activation  when  Using  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  "Y"  to  first  guestion,  then  enter  file  name; 
when  offered  a  test,  type  21.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  21.  Enter  H 0”, WV",. . . "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

When  prompted  by  Lochinvar: 

(1)  Enter  index  of  variable  to  be  used  in  test, 
if  there  is  more  than  one  in  the  data  set. 

(2)  Enter  whether  to  test  against  the  best 
fitting  normal  or  exponential  distribution. 

3.  ISSt  BX££U£i£n 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CMS. 

4.  Output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  sample  mean  and  variance  are  listed,  as  well  as 
the  test  statistic. 
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C.  ACTIONS  PERFORMS  D  BY  THE  PROGRAM  CRUNCH 


1 .  Descri pt ion  of  Calc ulation  of  Test  Statistic 

a.  Finding  the  MLS  Parameters 

The  sample  mean  and  variance  are  computed.  If 
the  test  is  for  normality,  the  theoretical  distribution' s 
parameters  mu  and  sigma  squared  are  estimated  by  the  sample 
mean  and  variance  respectively.  If  the  exponential  is  the 
theoretical  distribution,  its  parameter  lambda  is  the  recip¬ 
rocal  of  the  sample  mean. 

b.  Calculation  of  the  Test  Statistic 

The  remainder  of  the  test  is  exactly  the  same  as 
the  two-  tailed  Kolmogorov  test.  The  empirical  distribution 
function  is  generated  using  (1/n)  increments  for  each  of  the 
n  sample  data  elements.  The  difference  between  the  empir¬ 
ical  and  theoretical  distribution  is  computed  at  each  value 
of  x  contained  in  the  sample.  The  greatest  absolute  differ¬ 
ence  is  the  tast  statistic. 

2.  Program  Uat^od  of  Operation 

The  subroutine  Lilief  calls  Muvar  to  find  the  sample 
mean  and  variance,  then  assigns  MLB  values  to  the  rp  vector. 
The  subroutines  Norf  cr  Expf,  as  appropriate,  are  called  to 
generate  values  of  the  theoretical  distribution  at  the 
values  of  X  (1 )  , .. .  X  (n) .  The  subroutine  Findt  finds  the 
largest,  smallest  and  greatest  absolute  difference;  that 
last  value  is  the  test  statistic. 

3.  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points;  30,  in  one  vari¬ 
able. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Lillie 

(2)  Crunch;  Lilief,  Muvar, Expf, Norf , Findt 

(3)  External;  Shsort 


XXVI.  SHAPIR3  Wljyc  TEST  P3  R  NORMALITY 

A.  OVERVIEW 

1  •  General  Purpose 

With  a  sample  of  30  or  less  data  points,  test  to  see 
if  the  sample  is  from  a  normal  distribution  of  unspecified 
parameters.  Although  Crunch  computes  the  sample  mean  and 
variance,  they  are  not  displayed.  No  statement  about  the 
population  parameters  is  made  by  this  test. 

2.  Level  of  Mea  sure  men t 
Interval 

3 .  Assumptions 

The  sample  is  randomly  drawn  from  the  population. 

4 .  Hypothesis 

With  P*(x)  as  a  normal  distribution  of  unspecified 
parameters,  and  S(x)  as  the  distribution  function  of  the 
population, 

a.  Null  hypothesis 

HO:  P*(x)=S(x)  for  all  x 

b.  Alternative  hypothesis 

HI:  P*(x)  #S(x)  for  at  least  one  x 

B.  ACTION  REQUIRED  BY  USER  OF  PROGRAM 

1  •  User  Decisions  aboijt  Program  Ofitions 

If  there  is  more  than  one  variable  in  the  test,  the 
user  must  specify  which  one  is  to  be  utilized.  If  that 
variable  has  more  than  30  elements,  the  goodness  of  fit  test 
will  be  automatically  offered.  If  not,  no  further  choices 
are  necessary. 


2.  Program  Activation  when  Using  Data  Values 

a.  Initial  Choices 

See  section  III.C  for  a  derailed  explanation  of 
the  following. 

In  CMS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  ”Y”  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  22.  If  data  needs  to  be  entered, 
enter  "H"  for  first  guestion,  WV"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  22.  Enter  "  0”,  "V”, . . .  ”Z”  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 
When  prompted  by  Lochinvar: 

Enter  index  of  variable  to  be  used  in  test,  if 
there  is  more  than  one  in  the  data  set. 

4 .  Test  Execution 

Tc  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

5.  Output  Available  to  the  Us3£ 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  ”U  LISTING”.  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  test  statistic  will  also  be  included. 

C.  ACTIONS  PBRFORHED  BY  THE  PROGRAH  CRUNCH 

1.  Description  of  Calculation  of  Test  Statistic/Level 

a.  Compute  the  Sum  of  Squared  Deviations 

The  sum  of  squared  deviations  is  the  sum  over  n 
of  the  square  of  the  difference  between  X(i)  and  the  mean  of 
X.  The  program  actually  figures  the  sample  variance  and 
multiplies  by  (n-1)  .  This  is  the  denominator  of  the  test 
statistic. 


b.  Coopute  the  Sum  of  the  Weighted  Differences 

The  sample  is  sorted  into  the  order  statistics, 
X(  1)  , . .  • ,  X(n).  Subtract  values  with  the  same  depth  from 

the  end  points, e.g,  subtract  X(n)  -  X(1),  X(n-1) 

X (2)  , . . . , X  (n-i+ 1 )  -  X(i).  The  n  is  even,  there  are  n/2 

differences;  if  n  is  odd,  there  are  (n-1)/2  differences, 
with  the  median  excluded.  Multiply  these  differences  by 
a(i)  weights,  provided  by  Lochiavar.  Sum  the  weighted 
differences.  This  is  the  numerator  of  the  test  staistic. 

c.  Compute  the  Test  Statistic 

Divide  the  sum  of  weighted  difference  by  the  sum 
of  sguared  deviations. 

2.  Program  Method  o£  0 peration 

The  subroutine  finds  the  sample  variance  by  calling 
Nuvar.  Shsort  is  used  to  order  the  sample  into  order 
statistics  X(1)  ,.. . ,  X(n)  .  A  do  for  loop,  i=1  to  trunc(n/2), 
is  used  to  calculate  the  sum  of  a  (i)  *  (X  (n-i  +  1) -x  (i) )  ;  the 
a(i)'s  are  contained  in  the  rp  vector. 

3 -  EI23B.&  C^l£§cteristi£s 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables/treatma nts:  15. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Wilkes 

(2)  Crunch:  wilksh,Muvar 

(3)  External:  Shsort 


XXVII.  SMIRNOV  TES2 


A.  OVERVIEW 

1 .  ssasial  Pai22se 

With  two  samples,  X  and  X,  each  drawn  from  a 
different  population,  test  whether  the  underlying  population 
distributions  P(x)  and  G(y)  could  be  equal  over  the  entire 
range  of  X  and  T. 

The  Smirnov  and  Craner-von  Mises  test  differ  only  in 
the  computation  of  the  test  statistic,  and  therefore,  the 
choice  of  hypotheses. 

2 .  Level  2 £  Measurement 
ordinal 

3.  Agsumgtlofls 

The  samples  are  drawn  randomly  from  their  popula¬ 
tions,  and  are  rankabls  against  each  other  as  well  as  within 
samples. 

*»•  BYBQttlSSeS 

a.  Two  tailed  test 

(1)  Null  hypothesis  hD:  F(x)=g(x)  for  all  x 

(2)  Alternative  hypothesis  hi:  F(x)*g(x)  for  at 
least  one  x  in  range  of  sample 

b.  One  tailed  test 

(1)  Null  hypothesis  h3:  F(x)<g(x)  for  all  x 

(2)  Alternative  hypothesis  hi:  F(x)>g(x) 

c.  One  tailed  test 

(1)  Null  hypothesis  hD:  P(x)>g(x) 

(2)  Alternative  hypothesis  hi:  P(x)<g(x) 


B.  ACTICN  REQUIRED  EY  USES  OF  PROGRAM 

1.  User  Decisions  a  boat  grogbU-  O££ion§ 

The  user  last  designate  which  of  two  variables  are 
used,  if  the  data  set  contains  sore  than  two  variables.  The 
user  specifies  the  type  of  hypothesis. 

2.  Program  Activation  when  l&lASLS 

a.  Initial  Choices 

See  section  III.C  for  a  detailed  explanation  of 
the  following. 

In  CHS,  type  LOCHI.  If  data  is  already  in  a  data 
file,  enter  MY"  to  first  question,  then  enter  file  name; 
when  offered  a  test,  type  23.  If  data  needs  to  be  entered, 
enter  "B"  for  first  question,  "V”  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  23.  Enter  "  U«,"V”,. . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 

when  prompted  by  Lochinvar:  Enter  indices  of 

variables  to  be  used  in  test. 

3-  XSSI  £iecaU2n 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  Available  ta  £he  usej 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  "U  LISIING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  The  test  statistic  will  also  be  listed. 


C.  ACTIONS  PEBFOBMED  BY  THE  PBOGBAM  CBONCH 


1 .  Description  of  Calculation  of  Test  Statistic 

a.  Calculate  the  Difference  Vector  S 

X  and  Y  are  sorted  into  order  statistics, 
1(1)  ,X(m),  and  Y  (1)  . ,Y  (n)  .  The  counter  for  X  is  i, 

the  counter  for  Y  is  j.  Compare  X(1)  with  Y <  1)  .  If  X(1)  < 
Y(  1)  then  increaent  i;  if  X  (1)  >  Y(1)  then  increment  j.  The 
difference  between  the  empirical  distribution  functions  is 
(i-1)/m  minus  (j-1)/n.  After  each  comparison  of  X  (i)  and 
Y(j)  the  appropriate  counter  is  incremented  and  the  differ¬ 
ence  is  calculated  and  stored  in  the  vector  S.  The  counter 
i  is  incremented  until  it  reaches  (a+1);  j  can  reach  (n+i). 
If  X  (i)  *  Y  (j)  ,  increaent  i,  calculate  and  store  the  differ¬ 
ence,  then  increaent  j,  and  compute  and  store  the  differ¬ 
ence.  There  will  be  (a+n)  elements  in  S. 

b.  Calculate  the  Test  Statistic 

For  a  two  tailed  test  (hypothesis  A),  the 
greatest  absolute  difference  is  the  test  statistic.  For 
hypothesis  B,  the  largest  difference  is  the  tast  statistic. 
For  hypothesis  C,  the  smallest  difference  is  the  test 
statistic. 

2 •  liaafll  Sat&oJ  2l  Qperatlon 

The  subroutine  Sairn  calls  Shsort  to  rank  X,  then 
rank  Y.  It  calls  the  subroutine  Finds  to  calculate  the 
difference  vector  S,  then  chooses  the  test  statistic  based 
on  the  hypothesis  specified. 

3.  £xggna 

a.  Limitations 

Maximum  number  of  data  points:  400.  Maximum 
number  of  variables:  2. 

b.  Subroutines/Procedures  Osed 

(1)  Lochinvar:  Smirnov 

(2)  Crunch:  Sairn,  Finds 

(3)  External:  Shsort 


XXVIII.  CRAMER-VON  MI§ES  TEST 


A.  OVERVIEW 

1 .  General  £ui£ose 

with  two  samples,  X  and  I,  tast  whether  the  under¬ 
lying  population  distributions  F(x)  and  G(y)  could  be  equal 
over  the  entire  range  of  X  and  Y. 

The  Smirnov  and  Cramer-von  Nises  test  differ  only  in 
the  computation  of  the  test  statistic,  and  therefore,  the 
choice  of  hypotheses. 

2.  Level  of  Measurement 

Interval 

3 .  Ag§uae t£sa§ 

The  samplas  are  randomly  drawn  from  the  respective 
populations,  and  are  independent  of  each  other. 

4 .  Hypothesis 

a.  Null  hypothesis 

HO:  F  (X)  *G  (X)  for  all  x 

b.  Alternative  hypothesis 

HI:  F  (x)  >G  (xj  for  at  least  one  x 


B.  ACTION  REQUIRED  BY  USER  OF  PROGRAM 

1  •  User  Decisio ns  about  Ppop^lS.  9Btio£t§ 

If  there  are  more  than  two  variables  in  the  data 
set,  specify  which  two  are  in  the  test. 

2.  Program  Activation  whep  U§iaa  Data  Values 
a.  Initial  Choices 

See  section  III.C  for  initial  choice  details. 

In  CMS,  type  L3C HI.  If  lata  is  already  in  a  data 
file,  enter  "T"  to  first  question,  then  enter  file  name; 
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when  offered  a  test,  type  28.  If  data  needs  to  be  entered, 
enter  "N"  for  first  question,  "V"  for  second;  when  data 
entered,  choose  to  do  a  test;  when  offered  a  choice  of 
tests,  type  28.  Enter  •'  0", "V",. . .  "Z"  for  desired  output 
file. 

b.  Variable  and  Parameter  Choice 
When  prompted  by  Lochinvar: 

Enter  indices  of  variables  to  be  used  in  test., 
if  the  data  set  has  more  than  two. 

3.  Test  Execution 

To  perform  the  test,  after  the  above  entry  of  data 
and  parameters,  type  CRUNCH  while  in  CHS. 

4.  Output  Available  to  the  User 

The  program  CRUNCH  displays  the  output  file  to  which 
the  data  has  been  sent,  e.g.,  nU  LISTING".  The  names  of  the 
data  set  and  variables  used  are  at  the  top  of  the  output 
file.  In  addition,  the  test  statistic  is  listed. 


C.  ACTIONS  PERPORHED  BY  THE  PROG  RAH  CRUNCH 


1 •  Description  of  Calculation  of  Test  statistic 
a.  calculate  the  Difference  vector  S 

This  step  is  exactly  the  same  (using  the  same 
subroutine)  as  in  the  Smirnov  test.  see  section  XXVII  for 
details. 


b.  Calculating  the  Test  Statistic 

Find  the  sum  of  the  squared  elements  of  the  s 
vector,  with  m  as  the  number  of  elements  of  X,  and  n  as  the 
number  of  elements  of  Y,  multiply  the  sum  of  squared  s  (i) 
times  (m*n) /(a*n)  **2 .  This  is  the  test  statistic. 


2.  Program  Method  of  0  peration 


Thesubroutine  Crmvon  calls 
order  Y  into  order  statistics.  The 
the  vector  S.  In  a  do  for  loop. 


Shsort  to  order  X  and 
subroutine  Finds  finds 


Crm  von 


i=  1  to  (m«-n)  , 


That  sua  is  then  multiplied  by 


squares  and  suns  S  (i) . 

(n*n)  /  (a+n)  **2. 

3 .  Program  Characteristics 

a.  Limitations 

Maximum  number  of  data  points: 
number  of  variables:  2. 

b.  Subroutines/Procedures  Used 

(1)  Lochinvar:  Cramvon 

(2)  Crunch:  Crmvon,  Finds 

(3)  External:  Shsort 
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APPENDIX  A 

TUTORIAL:  LINKING  TO  AND  USING  THE  COMPUTER  PACKAGE 

A.  PURPOSE 

This  appendix  contains  three  things:  instructions  on 

hoir  to  link  to  the  class  disk  to  perform  nonparaaetric 
tests;  instructions  on  hovr  to  copy  the  programs  onto  a 

user's  A  disk;  and  an  example  of  how  to  use  the  program. 

\ 

B.  LINKING  TO  THE  CLASS  DISK 

To  perform  the  test,  a  student  can  link  to  user  number 
2266p,  read  only  password  OBRIEN,  and  use  the  programs  there 
to  perform  nonparametric  tests.  The  steps  are: 

1 .  Linking  t£  the  Class  Disk 

In  CMS,  type  "CP  LINK  2266P  191  192  RR".  The 

computer  will  type  back  ENTER  READ  only  PASSWORD.  Enter 
"OBRIEN".  After  receiving  back  the  "R"  message,  enter  ACC 
192  B/A. 

2.  Using  the  Programs 

An  example  of  the  use  of  Lochinvar  and  Crunch  is 
given  belcw.  Detailed  instructions  for  use  are  given  in 
section  III  an d  the  sections  on  specific  tests. 

3 .  Releasing  the  £lass  Disk 

After  you  are  done,  type  RELEASE  192. 

C.  COPYING  THE  PACKAGE 

Three  files  are  needed  to  run  nonparametric  tests: 
"LOCHI  MODULE",  "CSUNCH  TEXT",  and  "CRUNCH  EXEC".  The  first 
two  are  compilations  of  the  source  code,  contained  in  "LOCHI 
PASCAL"  and  "CRUNCH  FORTRAN".  To  copy  any  of  these 
programs: 


1.  Link  to  the  Class  Disk 

2.  Copy  the  File:  Type  XEDIT  <filename>  <filetype>.  Type 
SAVE  *  *  A  in  the  space  narked  ==>  at  the  bottom  of  the 
screen#  then  FILE.  The  file  will  be  on  the  A  disk. 

3.  Release  the  class  Disk 


D.  AN  EXAMPLE  OF  TEST  DS  E 


To  fully  understand  this  example,  read  or  review 
sections  III  and  XXI  of  this  text.  The  data  set  used  is 
from  Conover  [Ref.  1:  p.319];  the  test  used  is  the  Friedman 
test.  The  example  shows  the  program’s  prompting  in  capital 
letters#  and  the  user's  response  in  lower  case,  with  a  <== 
to  the  right. 

1.  Entering  Dat§  File 

a.  Preliminary  Steps 

There  are  four  initial  steps  to  perform  before 
entering  a  data  file.  The  first  is  to  activiate  Lochinvar, 
then  to  request  not  to  use  a  previously  entered  file,  then 
to  enter  the  file  by  data  values,  and  finally  to  write  the 
information  to  data  file  "lochi  a1'. 

lochi  <=*  (1) 

EXECUTION  BEGINS... 

DO  YOU  BANT  TO  PERFORM  A  TEST  ON  A  PREVIOUSLY  ENTERED  DATA  SET? 

ENTER  Y  OR  N 
n  <*»  (2) 

YOU  HAVE  A  CHOICE  OF  TWO  METHODS  TO  ENTER  DATA: 

1.  ENTER  EACH  VALUE  OF  DATA  SET.  VALUES  MUST  BE  NUMERICAL,  AND 

MEASUREMENT  AT  LEAST  ORDINAL.  ALL  TESTS  MAY  BE  PERFORMED  THIS  WAY. 

2.  ENTER  ONLY  THE  COUNT  OF  SUCCESSES /FAILURES  OR  ROW/COLUMN  CELL 

COUNT.  DATA  VALUES  AND  LEVEL  OF  MEASUREMENT  MAY  BE  NOMINAL,  BUT 

COUNTS  MUST  BE  NON-NEGATIVE  INTEGERS.  THE  FOLLOWING  TESTS  MAY  BE 

PERFORMED:  BINOMIAL,  MCNEMAR,  R*C  CONTINGENCY,  CHI-SQUARE 

GOODNESS  OF  FIT. 

ENTER  V  FOR  VALUE  OR  C  FOR  COUNT 
v  <*«  (3) 

ENTER  FILETYPE  (A-F)  TO  WHICH  DATA  GOES 
a  <==  (4) 

b.  Enter  Information  of  Label  of  Data  Set 

The  label  needs  three  types  of  information:  the 

data  set  name,  the  number  of  variables,  and  the  variable 
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naaes.  For  tars  example,  we  are  using  the  data  of  exercize 
1,  page  309  in  Conover,  with  four  variables,  named  the  names 
of  the  seasons: 

EHTEB  NAME  OF  DATA  SET 

prob  1,p  309,  number  of  births  by  hospital  <==  (1) 

ENTER  NUMBER  OF  VARIABLES  OR  TREATMENTS 
4  <=*  (5) 

ENTER  NAME  OF  VARIAELE  NR  1 

winter  <**  (3a) 

ENTER  NAME  OF  VARIABLE  NR  2 

spring  <**  (3b) 

ENTER  NAME  OF  VARIAELE  NR  3 

summer  <*=  (3c) 

ENTER  NAME  OF  VARIABLE  NR  4 

fall  <»*  (3d) 

c.  Enter  Data  Elements 

Next,  enter  the  data  when  prompted.  Enter  either  inte¬ 
gers  or  real  numbers,  i.  e. ,  with  or  without  decimal  points; 
the  program  will  read  them  as  reals.  Leave  a  space  (not  a 
comma)  in  between  numbers.  Enter  all  elements  of  a  variable 
on  one  line  if  possible;  if  not,  there  will  be  a  chance  to 
add  more  elements  after  the  "ENTER"  key  is  pressed.  There 
will  also  be  a  chance  to  check  or  correct  entries.  In  the 
following  example,  a  mistake  is  made,  then  corrected,  in 
entry  of  variable  one.  In  variable  two  the  data  is  checked 
and  found  correct;  in  variable  three  it  is  not  checked.  in 
variable  four,  the  first  four  data  elements  are  listed,  the 
"ENTER"  key  is  pressed,  the  last  three  data  elements  are 
added,  and  the  data  checked. 

(1)  Enter,  Check,  and  Correct  the  Values  of 
Variable  One: 

BHEH  ENTERING  DATA  ELEMENTS  <1,  START  WITH  0. 

ENTER  VALUES  FOR  VARIABLE  1 

92  9  98  19  21  58  43  <=» 

ADD  MORE  ELEMS  TO  SAME  VARIABLE? 

OR  CHECK  THIS  VAR?  OR  GO  TO  NEXT? 

ENTER  A  (ADD), C  (CHECK)  OR  S  (  GO  ON) 

NUMBEB*OF  ELEMS  ENTERED  7 

NUMBER  OF  ELEMS  IN  THIS  VAR  7 

92.000  9.000  98.000  19.000  21.000  58.000  43.000 
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(The  last  entry  should  have  been  '42'  vice  '43’ 
so  re-enter  variable:) 

WANT  TO  RE-ENTER  THIS  VARIABLE? 

ENTER  Y  CR  N 

EnIeR  VALUES  FOR  VARIABLE  1 

92  9  98  19  21  58  42 
ADD  BORE  ELEMS  TO  SAME  VARIABLE? 

OR  CHECK  THIS  VAR?  OR  50  TO  NEXT? 

ENTER  A  (AD  D)  ,  C  (  CHECK)  OR  3(G0  ON) 

C  <sss 

NUMBER  OF  ELEMS  ENTERED  7 

NUMBER  OF  ELEMS  IN  THIS  VAR  7 

92.000  9.000  98.000  19.000  21.  000  58.000  42.000 

WANT  TO  RE-ENTER  THIS  VARIABLE? 

ENTER  Y  OR  N 
n  <== 

(2)  Enter,  Check  Values  for  Second  Variable: 

ENTER  VALUES  FOR  VARIABLE  2 

112  11  109  26  22  71  49  <== 

ADD  MORE  ELEMS  TO  SAME  VARIABLE? 

OR  CHECK  THIS  VAR?  OR  GO  TO  NEXT? 

ENTER  A  (ADD)  ,C  (CHECK)  OR  5  (GO  ON) 

C  <== 

NUMBER  OF  ELEMS  ENTERED  14 

NUMBER  OF  ELEMS  IN  THIS  VAR  7 

112.000  1  1.000  109.000  26.  000  22.000  71.000  49.000 

(This  time,  it  is  right  the  first  time, 
so  no  re_entry:) 

WANT  TO  RE-ENTER  THIS  VARIABLE? 

ENTER  Y  CR  N 
n  <*= 


(3)  Enter  Values  for  Third  Variable: 

ENTER  VALUES  FOR  VARIABLE  3 

99  10  92  19  23  51  44  <=  = 

ADD  MORE  ELEMS  TO  SAME  VARIABLE? 

OR  CHECK  THIS  VAR?  OR  GO  TO  NEXT? 

ENTER  A  (ADD)  ,  C  (CHECK)  OR  G(GO  ON) 

g<aas 

(4)  Enter,  Add  to.  Check  Values  for  Fourth 

Variable: 


,  FOR  VARIABLE 
18  <== 

ADD  MORE  ELEMS  TO  SAME  VARIABLE? 
OR  CHECK  THIS  VAR?  OR  GO  TO  NEXT? 
ENTER  A  (ADD)  ,C  (CHECK)  OR  G(GO  ON) 
a  <== 

ENTER  VALUES  FOR  VARIABLE 
24  62  41  <=  = 

ADD  MORE  ELEMS  TO  SAME  VARIABLE? 
OR  CHECK  THIS  VAR?  OR  GO  TO  NEXT? 
ENTER  A  (ADD),  C(CHECK)OR  G  (  GO  ON) 

NUMBER  OF  ELEMS  ENTERED 
NUMBER  OF  ELEMS  IN  THIS  VAR 


23 


4 


4 
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77.000  12.000  81.000  18.000  24.000  62.000  41.000 

WANT  TO  BE-ENTER  THIS  VARIABLE? 


ENTER  Y  OR  N 
n  <=* 


(5)  Lochinvar  offers  a  chance  to  re-enter  the 
entire  data  set 

YOU  HAVE  ENTERED  4  VARIABLES, 

WITH  THE  FOLLOWING  NUMBER  OF  ELEMENTS  IN  EACH 
VAR  NR  NR  OF  ELEMENTS 

1  7 

2  7 

3  7 

4  7 

DO  YOU  WISH  TO  RE-ENTER  THE  ENTIRE  DATA? 

ENTER  Y  OR  N 
n  <== 


(6)  Lochinvar  Determines  the  Variables  are  of 


equal  size: 


THE  VARIABLES/TREATMENTS  ARE  OF  EQUAL  SIZE: 

PAIRED  COMPARISON  TESTS  ARE  POSSIBLE 

(7)  Choice  to  do  a  Test  with  this  DATA 

NOW.  DO  A  TEST  WITH  THIS  DATA  SET? 

ENTER  Y  OR  N 
y  <== 

2.  Choice  of  Test.  Test  Parameters.  Output  File 
a.  Choice  of  Output  Piles 

The  relevant  choices  are  *U  LISTING', »V 
LISTING' Z  LISTING'.  Anythin?  that  is  already  in  the 
file  will  be  written  over. 

WHICH  NAME  FOR  OUTPUT  FILE,  U-Z? 
u  <== 

t.  Choice  of  Test 

Lochinvar  will  offer  test  according  to  the  number 
of  variables  in  the  data  set  and  whether  the  variables  are 
of  equal  length.  Test  number  17  is  desired;  the  Friedman 
rather  than  the  Durbin  is  chosen. 


PICK  AMONG  THE  FOLLOWING  N ON-PAR AMET RIC  TESTS: 

1.  BINOMIAL 

2.  QUANTILE 

3.  COX-STUART  TEST  FOR  TREND  <2  7ARS,PAIR> 

4.  MANN- WHITNEY 
5. SIGN  <PAIR> 

6.  MCNEMAR  <PAIR> 

7. R*C  CONTINGENCY  T  ABLE  (CH  I-SQUARE)  <PAIR> 

8.  GOODNESS  OF  FIT,  N>14  (CHI-SQUARE) 

9.  WILCOXON  SIGNED  RANK  <?AIR> 

10.  RANK  CORRELATION  (KENDALL  S  SPEARMAN)  <PAIR> 

11.  NON-PARA  METRIC  LINEAR  REGRESSION  <?AIR> 

12.  NON-PARAMETRIC  MONOTONE  REGRESSION  <PAIR> 
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13.  MEDIAN 

14.  KRUSKAL/ WALLIS 

15.  SQUARED  RANK  TEST  POR  EQUAL  VARIANCE 

16.  HARTLEY  TEST  FOR  EQUAL  VARIANCE 

17.  FREIDMAN  DR  DURBIN  <BLO  CKED> 

18. QUADE  <BLOCKED> 

19.  COCHRAN  <BLOCKED> 

20.  KOLMOGOROV (ONE  S AMPLE : PA RAMETERS  SPECIFIED 

21.  LILLIEFORS  (ONE  S  AMPLE;  N  ORMAL  OR  EXPONENTIAL) 

22.  WILKES -SHAPIRO  TEST  FOR  NORMALITY 
23. SMIRNOV  (2-SAMPLE) 

24.  CRAMER-VON  MISES  (2-SAMPLE) 

DO  YOU  WANT  TO  SEE  THE  LIST  AGAIN? 

ENTER  Y  OR  N 
n  <== 

ENTER  THE  NUMBER  OF  A  TEST  LISTED  ABOVE 
17  <== 

THIS  WILL  ALLOW  THE  FRIEDMAN  OR  THE  DURBIN  TEST 
<DURBIN  IS  INCOMPETE  BLOCK  DESIGN:  NEEDS  FILLER> 

DO  YOU  WANT  TO  PERFORM  THE  FRIEDMAN  TEST? 

ENTER  Y  OR  N 
y  <== 

c.  Choice  of  Test  Parameters 

For  this  example,  only  the  first,  second  and 
fourth  variables  are  used  in  the  test,  and  the  third  block 
of  the  data  sat  (i.e.,  the  third  element  of  each  variable) 
is  excluded  from  the  test.  The  final  test  parameter  entered 
is  the  level  of  significance. 


(1)  Variables  Included  in  Test 


WHEN  ENTERING  PARAMETERS  <1.  START  WITH  0. 
DO  YOU  WANT  ALL  4  INCLUDED  IN  TEST 
ENTER  Y  OR  N 
n  <=* 

HOW  MANY  VARIABLES  ARE  INCLUDED  IN  TEST? 

3  <*= 

ENTER  INDICES  OF  VARIABLES  TO  BE  IN  TEST 
ENTER  INDEX  OF  INCLUDED  VARIABLE  1 
1  <== 

ENTER  INDEX  OP  INCLUDED  VARIABLE 
2  <=* 

ENTER  INDEX  OF  INCLUDED  VARIABLE 

4  <** 


2 

3 


(2)  Exclude  Third  Block 


DO  YOU  WISH  TO  EXCLUDE  ANY  BLOCKS  FROM  TEST 
ENTER  Y  OR  N 

HO§  MANY  BLOCKS  DO  YOU  WISH  TO  EXCLUDE? 

UP  TO  7  BLOCKS 
1  <** 

IN  ASCENDING  ORDER,  ENTER  INDICES,  1  -  7 

ENTER  INDEX  FOR  EXCLUDED  BLOCK  NR  1 
3  <»« 


(3)  Specify  Level  of  Significance 


ENTER  LEVEL  OF  SIGNIFICANCE,  ALPHA 
ENTER  NUHBER>0,  IF  <1,  STA RT  WITH  0. 
0.05 


3.  £u£  with  Program  Crunch 

In  CHS,  type  CRUNCH 

4.  check  output  File 

Crunch  will  ccmput  the  test  statistic , than  display 

THE  RESULTS  ARE  LISTED  IN  FILE 
U  LISTING 

To  check  the  results,  type  XEDIT  D  LISTING 


APPENDIX  3 

THE  PB03RAM  LOCHINVAR 


A.  Purpose 

This  appendix  is  intended  to  assist  in  modifying  the 
program  Lochinvar,  or  in  implementing  it  on  a  computer 
system  other  than  the  IBM-3033  at  the  Naval  Postgraduate 
School.  The  last  part  of  this  appendix,  general  description 
of  program  operation,  may  be  of  interest  to  a  user  who  is 
experienced  in  the  use  of  Lochinvar. 

Section  III  of  the  text  describes  the  general  use  of  the 
program,  while  Sections  V  through  XXVIII  give  specific 
instructions  on  responding  to  Lochinvar* s  promptings  for  the 
various  tests.  Appendices  C  and  D  explain  in  detail 
Lochinvar's  two  output  files,  the  data  file  and  the  options 
file,  respectively. 

The  source  code  is  contained  in  Appendix  H. 

B.  DESCRIPTION  OP  PBOGBAM  CH ABACTEBISTICS 

i  -  iasajaas 

Lochinvar  was  originally  written  and  tested  in 
Waterloo  Pascal;  it  is  presently  in  a  form  compatible  with 
the  IBM  virtual  system  Pascal  compiler,  PASCALVS.  The 
program*s  syntax  is  consistent  with  the  Pascal  described  by 
Zaks  [Ref.  3]. 

2.  Input  and  Output  Files 

In  Lochinvar,  input  or  output  files  are  referred  to 
by  one  of  nine  t9Xt  definitions.  "Qin"  refers  to  terminal 
input;  "gout"  refers  to  terminal  i~put;  "op"  refers  to  the 
options  file,  "lochi  op".  Text  "a"  through  "f"  refer  to 
data  files  "lochi  a"  through  "lochi  f";  these  data  files  are 
used  both  as  input  and  output. 


Prior  to  using  a  file  for  output,  Lochinvar  must 
open  that  file  by  the  "rewrite"  command.  For  the  data 
files,  this  occurs  in  the  procedure  New_data;  for  the 
options  file,  it  occurs  in  the  procedure  File_io.  All  seven 
of  those  files  are  explicitly  defined  as  fixed  format, 
logical  record  length  80.  The  form  of  the  rewrite  command, 
here  illustrated  for  "lochi  a",  is: 

REWRITE  (A,*  NAME  *  LD  CHI .  A  .  A,  RECFM=F, LB ECL  =  80  • ) 

The  terminal  output  is  opened  in  the  main  program, 
using  the  command  "TERHOOT  (QOOT) 

Prior  to  reading  a  file  as  input,  Lochinvar  must 
open  that  file  for  reading  with  the  "reset"  command.  For 
the  data  files,  this  occurs  in  the  procedure  Check_file. 
The  terminal  input  file  must  be  reset  prior  to  each  time 
data  is  entered  from  the  keyboard.  Bather  than  having  reset 
commands  scattered  throughout  the  program,  the  procedure  Rst 
is  invoiced.  Rst  resets  the  terminal  for  input  with  the 
command  "TEBBIN  (QIN)  ". 

3.  Dimensions  c>.£  Structures 

At  the  beginning  of  Lochinvar  are  six  constants. 
Tot_data  defines  the  total  number  of  data  elements  contained 
in  the  data  set;  it  is  currently  sat  at  400.  Tot_vars  is 
the  maximum  number  of  variables  in  a  data  set  plus  one;  it 
is  currently  set  at  16  (for  15  variables) .  L_names  is  the 
length  of  variable  or  data  set  names;  it  is  currently  set  at 
50.  L_r_par,  L_i_par,  and  L_e_b  are  the  number  of  real 
parameters,  the  number  of  integer  parameters,  and  number  of 
excluded  blocks,  respectively;  each  is  currently  set  at  50. 

Changing  the  setting  of  any  one  of  those  numbers 
will  change  all  subsequent  dimensioning  within  the  program. 
It  is  not  recommended  that  L_names  be  increased  beyond  80, 
since  this  would  place  names  on  more  than  one  line  (card 
image)  of  the  data  or  options  file. 
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4.  Type  ajjd  Variable  Declarations 

Below  the  constant  declarations,  the  type  declations 
define  arrays,  sets  and  types.  Below  that  are  variable 
declarations,  A  11  parameters  are  defined  globally. 

5.  Description  Ii£la bin,  Sets 

Below  the  variable  declarations,  a  comment  section 
gives  brief  descriptions  of  each  variable,  set  or  array 
passed  as  a  parameter. 

6.  Description  of  Procedures  aa£  Functions 

Each  subprogram  element  contains  a  brief  comment 
section  describing  its  purpose. 

C.  DESCRIPTION  OF  PBOGRAH  OPERATION 

The  main  program  offers  the  user  three  choices:  use  a 
previously  enter  data  set  to  perform  a  test,  enter  a  new 
data  set,  or  enter  cell  counts.  See  Section  III  of  the  text 
for  a  complete  explanation  of  these  choices.  Depending  on 
the  user's  choice,  the  sain  program  will  call  one  of  three 
master  subprograms  to  prompt  the  user  through  the  data  and 
parameter  entries: 

1 .  New  Data 

The  master  subprogram  New_Data  prompts  entry  of  a 
new  data  set  and  offers  the  user  the  chance  to  perform  a 
test  with  the  newly  entered  data.  Steps  in  this  process: 

a.  Data  Set  File  Type 

New_data  asks  the  filetype  for  the  data,  "a" 
through  "fH. 

b.  Names  and  Number  of  Variables 

New_data  calls  Namer,  which  asks  the  data  set 
name,  the  number  of  variables,  and  the  variable  names. 

c.  Data  Element  Values 

If  there  are  two  variables,  New_data  offers  the 
choice  of  entering  data  pair  by  pair.  If  the  user  chooses  to 


do  this,  New_data  calls  E_pairs.  Otherwise,  the  procedure 
E_by_var  prompts  entry  of  data  variable  by  variable. 

d.  Writing  of  Data  File 

Hev_data  writes  the  data  sen  file. 

e.  Test  Choice 

New_data  asks  if  a  test  should  be  performed  with 
this  data.  If  yes,  File_io  and  (ihichjtest  are  called;  these 
procedures  are  described  in  the  next  paragraph. 

2.  Old  data 

If  the  user  indicates  that  a  previously  entered  data 
set  is  to  be  used,  the  procedure  01d_data  is  called.  It 
performs  the  following: 

a.  Data  File  Choice 

old_data  asks  the  user's  choice  of  file  type  of 
old  data,  "lochi  a"  through  "lochi  f".  01d_data  calls 
Chech^file,  which  read  and  displays  the  name  of  the  indi¬ 
cated  data  set.  If  it  is  the  correct  data  set,  it  also 
reads  the  number  of  variables  and  number  of  elements  per 
variable. 

b.  Output  File  Choice 

The  procedure  File_io  asks  to  which  output  file 
the  test  results  should  be  sent.  It  writes  the  number  of 
the  data  set  file  (HIF)  and  the  number  of  the  test  results 
file  (HOF)  to  line  one  of  the  options  file. 

c.  Test  Selection  and  Parameter  Choice 

01d_data  calls  the  procedure  Which_test.  This 
procedure  offers  some  or  all  of  the  nonparametric  tests, 
depending  on  the  number  of  variables  in  the  data  set  and 
whether  the  number  of  elements  per  variable  are  equal.  Once 
the  choice  is  made,  Which_test  writes  that  choice  to  line  2 
of  the  options  file,  along  with  whether  the  test  involves 
cell  counts.  Which_test  then  calls  the  procedure  for  the 
chosen  test. 
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d.  Test  Variable  and  Parameter  Selection 

Each  test  procedure  is  somewhat  different, 
however,  there  are  common  points.  If  the  test  involves 
cells,  the  test  procedure  prompts  cell  related  parameters, 
then  calls  W_cells  to  write  those  parameters  to  line  3  of 
the  options  file  (see  Appendix  D  for  details  of  that  file)  . 
If  the  data  set  contains  more  variables  than  the  minimum 
needed  for  the  test,  Hhich_vec  will  prompt  the  user  to  enter 
which  variables  are  included  in  the  test.  If  the  test 
involves  paired  or  blocked  data,  the  procedure  Exclud_b 
offers  the  user  the  chance  to  exclude  pairs  or  blocks  from 
the  test;  in  other  cases,  None_ex  writes  a  "0"  to  line  5  of 
the  options  file.  The  test  procedure  then  prompts  entry  of 
various  integer  and  real  parameters,  and  calls  the  procedure 
S_param  to  write  those  parameters  to  lines  6  and  7  of  the 
options  file. 

3 .  Cel  test 

If  the  user  specified  to  enter  cell  counts,  the 
procedure  Cel_test  is  called  by  the  main  program.  It  calls 
File_io  to  write  the  NIF  and  NOF  to  the  data  file.  It  calls 
Maser  to  get  the  data  set  and  variable  names.  The  choice 
test  dictates  which  procedure  prompts  the  cell  count  entry. 
Cel_test  writes  the  data  set  name  aid  variable  names  to  line 
8  of  the  options  file. 


APPENDIX  C 
THE  DATA  PILE 


A.  PURPOSE 

This  appendix  explains  the  structure  of  the  data  files, 
which  contain  the  data  set  and  variable  names,  the  number  of 
variables  and  number  of  elements  per  variable,  and  the  data 
elements  of  the  variables.  This  will  aid  the  user  in  under¬ 
standing  the  computer  package.  More  importantly,  it  will 
allow  the  user  to  test  data  generated  by  another  program,  or 
read  from  cards  or  tape,  without  having  to  load  the  data 
into  Lochinvar  by  hand. 

B.  METHOD  OF  WRITING  DATA  FILES 

1.  Formats  of  Data  File  Components 

All  naaes  are  written  in  character  fields  of  length 
48,  from  the  first  to  the  forty-eighth  position  on  a  line. 
All  integers  are  written  in  a  1615  format,  i.  e. ,  with  the 
least  significant  digit  of  the  integer  on  position 
5, 10,. ..,80.  All  data  elements  are  read  as  real  numbers; 
they  are  written  to  be  read  in  an  8P10.3  format.  This  means 
that  there  are  eight  fields  for  real  numbers  per  line, 
1-10, 11-20,. ..  ,71-80 ,  and  the  real  numbers  have  three  digits 
to  the  right  of  the  decimal  point,  e.g.,  789.123.  Lochinvar 
includes  a  decimal  pcint  in  all  real  numbers;  if  this  is  not 
possible,  the  least  significant  digit  of  an  entry  (the  0.001 
digit)  must  be  right  justified  in  the  field,  i.e.,  aligned 
on  column  10,20,..  .,80. 

If  the  user  has  a  lata  set  whose  data  elements  are 
all  integers,  there  is  a  way  to  have  the  data  file  written 
without  converting  all  the  integers  to  real  numbers.  The 
least  significant  digit  of  the  integers  can  be  aligned  on 


columns  7,17,.  ..,77,  and  tha  program  Crunch  will  read  than 
as  real  numbers  with  tha  sama  value  as  original  integer 
data. 

If  tha  data  elements  of  tha  data  set  are  either  all 
very  large  or  all  very  small,  they  may  not  fit  into  the 
F10.3  format.  &  quick  fix  to  that  problem  is  to  rescale  the 
values,  e.g.,  multiply  very  small  numbers  by  1000,  or  divide 
large  numbers  by  that  amount.  Tha  validity  of  the  statis¬ 
tics  will  not  be  effected  by  that  transformation. 

If  thare  are  both  very  large  and  very  small  numbers 
in  the  data  set,  there  will  be  numerical  errors  in  single 
precision  Fortran  for  any  tests  which  use  an  interval  level 
of  measurement.  For  tests  which  use  ordinal  or  nominal 
level  of  measurement,  any  monotonic  transformation,  such  as 
y*ln  x,  could  be  used  to  allow  both  very  large  and  very 
small  data  values  to  fit  into  the  F13.3  format. 

2.  Lavcut  of  the  Data  File 

a.  Data  Set  name 

The  data  sjet  name  occupies  the  first  line  of  the 
data  file.  Phe  name  is  from  1  to  43  characters  long.  On 
the  same  line  as  the  name,  in  the  forty-ninth  position, 
thare  is  the  character  M1"»  which  on  IBM  keyboards  is  the 
upper  case  "1“.  For  example,  a  data  set  file  name: 

This  is  a  data  set  name  1 

b.  Number  of  Variables  and  Elements  per  Variable 

On  the  line  following  the  data  set  name,  there  is 
a  line  of  integers.  The  first  integer  is  the  number  of 
variables  in  the  data  set.  The  second  integer  is  the  number 
of  elements  in  the  first  variable;  the  third  integer  is  the 
number  of  elements  in  the  second  variable,  and  so  on.  There 
must  be  non-zero  integers  in  the  first  two  positions. 
Lochinvar  writes  zero  where  there  are  no  values,  but  rhat  is 
not  necessary.  As  mentioned  above,  integers  are  right 
justified  on  position  5, 10,..  .,80. 


Let  the  data  set  consist  of  two  variables.  The 
first  variable  has  two  data  elements,  the  second  has  four, 
the  first  two  lines  would  appear  as: 

This  is  a  data  set  name  i 

2  2  4  0  0  0  00... 

c.  Data  elements 

The  data  elements  are  listed  variable  by  variable, 
i.e.,  all  the  elements  of  the  first  variable  are  listed, 
then  all  cf  the  second,  et  cetera.  As  mentioned  above,  the 
data  elements  are  read  as  real  numbers  in  the  F10.3  format. 

Let  the  elements  of  the  first  variable  be  27.5  and 
31;  let  the  elements  of  the  second  variable  be  13,  17,  25 

and  35.  The  first  three  lines  of  the  data  file  would  appear 
as: 

This  is  a  data  set  name  I 

2240000000  0... 

27.500  31  .000  13.  000  17.  000  25.000  35.  000 

The  zeros  to  the  right  of  the  decimal  point  are 
written  by  Lochinvar  but  are  not  necessary.  The  number 
could  have  been  written  without  decimal  points,  with  proper 
alignment,  as  shown  below.  However,  when  numbers  are 
writtan  with  decimal  points,  a  mistake  in  alignment  is  no 
problem;  without  decimal  points,  alignment  is  crucial. 
Using  only  alignment  for  data  elements: 

This  is  a  data  set  name  I 

2  2  4  0  0  0  0  0  0  0  0... 

275  31  13  17  25  35 

d.  Variable  names 

The  variable  names  begin  on  the  line  directly 
following  the  last  data  element  line.  Each  variable  name 
occupies  a  line;  there  must  be  the  same  numoer  of  lines  for 
variable  names  as  there  are  number  of  variables  indicated  in 
line  two. 
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Let  freshman  be  the  name  of  the  first  variable  and 
senior  be  the  name  of  the  second. 

This  is  a  data  set  name  I 

2  2  4  0  0  0  0  0  0  0  0... 

27.500  31  .000  13.  000  17.000  25.000  35.  000 

freshman 
senior 

C.  LOCATION  ON  THE  USER'S  A  DISK 

1 .  Default  I(Oca  tion 

As  mentioned  repeatedly  throughout  the  text,  by  default 
the  data  files  are  named  "lochi  a", . .  ."lochi  f".  Lochinvar 
writes  files  to  those  six  locations;  Lochinvar  will  not  be 
easily  modified  unless  the  user  is  experience  in  programming 
Pascal. 

2.  Reading  Files  from  other  than  Default  Locations 
The  program  crunch  reads  "lochi  a" lochi  f"  based 

on  the  file  definitions  in  the  file  "CRUNCH  EXEC"  and  the 
number  of  input  file  (SI?)  of  the  options  file.  "CRUNCH 
EXEC"  is  explained  in  Appendix  E,  while  the  input  file  is 
explained  in  Appendix  D.  'To  have  Crunch  read  from  a  prop¬ 
erly  written  data  file  named  anything  other  than  "lochi 
a" , . . . , "lochi  f",  the  user  will  have  to  do  the  following: 

a.  Write  an  options  file  for  the  data  (see  section 
III  and  Appendix  D)  ,  either  manually  (not  recommended)  ,  or 
else  using  Lochinvar,  specifying  a  data  set  with  the  same 
general  c  la rac t eristics  as  the  test  data  set.  For  example, 
if  the  test  data  set  is  named  QUEUE  DATA,  with  four  vari¬ 
ables  of  equal  length,  tell  Lochinvar  to  run  the  desired 
test  on  data  set  "lochi  a",  given  that  "lochi  a"  has  four 
variables  of  equal  lenghth 

b.  Since  the  number  of  input  file  written  in  the 
options  file  is  "1",  corresponding  to  the  default  location 
"lochi  a",  change  the  file  definition  in  the  file  "CRUNCH 
EXEC".  The  origin  file  "CRUNCH  EXEC", 


STR ACE 

&ERROR  SEXIT  99 
FILEDEF  01  DISK  LOCHI  A 

LOAD  CRUNCH  (START 

becomes, 

STR ACE 

SERROR  SEXIT  99 

FILEDEF  01  DISK  QUEUE  DATA 

LOAD  CRUNCH  (START 

c.  Alternate  Method 

It  would  be  easier,  in  the  example  above,  to 
rename  the  current  file  "lochi  a11  to  some  other  name,  and 
rename  "QUEUE  DATA"  to  "lochi  a". 
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APPENDIX  D 
THE  OPTIONS  FILE 


A.  PURPOSE 

This  appendix  describes  the  options  file.  The  twenty 
nonparametric  tests  which  require  ordinal  or  interval  data 
level  of  measurement  all  have  options  files  with  similar 
format.  The  four  tests  which  can  handle  nominal  data  have 
slightly  idiocy ncratic  formats,  which  contain  all  the  infor¬ 
mation  of  other  test  plus  something  additional. 

This  appendix  is  included  for  completeness  of  documenta¬ 
tion  of  this  text.  It  is  not  recommended  that  a  user 
by-pass  Lochinvar  to  write  the  options  file  by  hand.  If  the 
computer  package  is  implemented  on  an  operating  system  which 
cannot  run  Pascal,  though,  that  may  be  necessary. 


B.  GENERAL  INFORMATION  ABOUT  THE  OPTIONS  FILE 
1 .  Naje 

Lochinvar  writes  the  options  file  to  "lochi  op”  on  the 
user's  A  disk. 

2.  General  F or ma t  Rules 

As  in  the  data  file,  all  integers  are  written  in  the 
format  1615,  all  real  numbers  are  written  as  8F10.3,  all 
names  are  in  48  character  lines.  See  Appendix  c,  paragraph 
B.  ,  for  all  the  different  ways  to  have  real  numbers  repre¬ 
sented  for  the  program  Crunch  to  read. 

3.  Explanation  of  the  Options  File  Format 
a.  List  of  Line  Numbers 

There  are  14  line  numbers:  1,  2,  3,  3a,  4,  4a,  5, 
5a ,  6,  6a,  7 ,  7a ,  8,  8a. 
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b.  Lina  Numbers  in  the  Option  File 

Each  of  the  14  line  numbers  occupies  at  least  cne 
line  (or  card  image)  in  the  options  file,  unless  that  line 
is  omitted.  There  is  not  a  blank  card  for  omitted  line 
numbers;  rather,  the  previous  line  is  followed  by  the  next. 
For  example,  if  line  numbers  3  and  3a  are  ommited,  line  4 
immediately  follows  line  2. 

The  lines  with  "a”  in  their  number  may  occupy  more 
than  one  line  (or  card  image)  in  an  actual  file,  because 

they  may  contain  more  information  than  would  fit  on  a  single 

80  column  card. 

c.  Rules  for  Inclusion  of  Lines  in  Options  File 

Lines  1  and  2  are  included  in  every  options  file. 

Lines  3  and  3a  are  omitted  unless  the  test  is  one 
of  the  four  which  handle  nominal  data. 

Lines  4  and  4a  will  be  included  in  every  test 
except  when  the  user  has  entered  cell  counts  (in  one  of  the 
four  tests  which  use  nominal  data)  .  Line  4  is  a  single 

positive  integer  value  which  tells  how  many  values  to  read 

in  line  4a. 

Line  5  equals  "O'*  and  line  5a  is  omitted  unless 
the  user  has  excluded  blocks  of  the  data  set  from  being 

tested.  In  that  case,  line  5  indicates  the  number  of 

elements  to  be  read  in  line  5a. 

Lines  6  and  7  are  included  in  every  test,  and 
will  contain  single  integer  values.  If  a  value  of  "0"  is  in 
one  of  those  lines,  then  the  corresponding  "a"  line  will  be 
omitted.  For  example,  if  line  6  has  a  "O'1,  then  line  6a 
will  be  omitted.  If  the  single  value  in  lines  4,  5,  6,  and 
7  is  net  "O'',  it  must  be  a  positive  integer,  representing 

the  number  of  elements  to  be  read  in  the  corresponding  "a" 

line.  For  example,  if  the  single  value  in  line  5  is  •• 3” , 
there  will  be  three  elements  to  read  in  line  5a. 


The  "a"  lines  may  occupy  as  many  actual  lines  ir. 
the  data  file  to  contain  all  the  values  to  be  read. 

Lines  8  and  8a  are  used  only  when  the  user  has 
entered  cell  counts  for  a  test  involving  nominal  data, 
d.  Content  of  the  Lines 

(1)  Line  1:  the  number  of  input  file  (NIF)  ,  and 
number  of  output  file  (NOP)  ,  both  integers 

(2)  Line  2:  the  number  of  the  requested  test 
(NTEST)  ,  and  whether  the  test  involves  cell  counts  (CELLS)  , 
both  integers,  CELLS=1  if  the  test  is  one  of  the  four  which 
can  use  nominal  data. 

(3)  Line  3:  the  cell  counter  option  (TTPCEL)  ,  the 
number  of  rows  in  the  cell  matrix  (NROtt)  ,  the  number  of 
columns  in  that  matrix  (NCOL)  ,  and  the  number  of  elements  in 
line  3a  (NTORO)  ,  all  positive  integers.  Depending  on  the 
TTPCEL  option,  line  3a  will  contain  either  integers  or 
reals. 

(4)  Line  4:  the  number  of  treatments  to  be 

included  in  this  test  (NTRBAT)  ,  a  positive  integer.  Line  4a 
contains  the  indices  of  the  variables  of  the  data  file  which 
are  included  in  the  data  set  to  be  tested. 

(5)  Line  5:  the  number  of  blocks  excluded  from  the 
test  (NEXB) .  Line  5a  contains  the  indices  of  excluded 
blocks. 

(6)  Line  6:  the  number  of  integer  parameters,  a 
non-negative  integer.  If  line  6  is  greater  than  zero,  line 
6a  contains  the  integer  parameters  for  the  test. 

(7)  Line  7:  the  number  of  real  parameters,  a  non¬ 
negative  integer.  If  line  7  is  greater  than  zero,  line  7a 
contains  the  real  parameters  for  the  test. 

(8)  Line  8:  data  set  name.  Line  8a  contains  the 
names  of  data  set  variables. 


C.  EXAMPLE  OF  OPTIONS  FILE  FOR  ORDINAL  OR  INTERVAL  TEST 


Table  5  contains  a  representative  data  file,  with  the 
line  numbers  of  the  options  file  for  the  twenty  tests  which 
require  ordinal  or  interval  data.  This  example  is  for  the 
test  to  perform  the  problem  contained  in  Appendix  A,  i. e. , 
problem  1,  page  309  of  Conover  [raf  1  ].  In  this  example, 
three  of  four  variables  were  used,  and  one  block  excluded. 
Line  number  are  enclosed  by  <  >.  Comments  are  enclosed  by 
(*-*)  ;  the  comments  refer  to  the  names  of  variables,  as 
mentioned  in  the  previous  section. 


TABLE  5 
0  ptions 


<1> 

1 

11 

[*NIF=1  ,NOP=1 1*) 

<2> 

<4> 

17 

3 

0 

♦NTEST  =17, C ELLS® 0*) 

♦NTREAT*3*) 

<4a> 

1 

2 

4  (^indices  of  treatments*) 

<5> 

<5a> 

<6> 

<7> 

<7a> 

1 

3 

0 

1 

0.95 

♦NEXB*) 

♦index  of  the  block  excluded*) 

♦number  of  intager  parameters=0*) 
♦number  of  real  parameters=1  *) 

♦real  parameter*!).  95*) 

APPENDIX  E 
THE  PROGRAM  CRUNCH 


A.  PURPOSE 

This  appendix  is  intended  to  assist,  implementing 
Crunch  cn  systems  ether  than  the  IBH-3033  at  the  Naval 
Postgraduate  School.  It  will  also  assist  in  modifying  the 
program,  or  adapting  portions  of  it  for  use  in  other 
programs. 


B.  DESCRIPTION  OF  THE  PROGRAM  STRUCTURE 

1 .  General 

While  Lochinvar  is  designed  to  be  used  only  as  an 
integral  unit.  Crunch  is  separable.  Therefore,  many  things 
which  are  centralized  in  the  main  program  of  Lochinvar--such 
as  dimensioning  and  variable  declaration — are  done  in  each 
subroutine  of  Crunch.  Explanations  and  documentation  are 
also  at  the  subprogram  level. 

2 .  Subprogram  Groups 

Crunch  can  be  thought  of  as  a  system  of  five  groups, 
each  made  up  of  several  subroutines  and  functions. 

a.  Main  Program 

The  main  program,  together  with  the  subroutines 
Fname  and  Vname,  fora  one  group;  they  read  the  data  set  and 
variable  names,  call  the  correct  data  formation  group,  then 
call  the  correct  test  subroutine,  and  remind  the  user  which 
file  contains  the  output. 

b.  Data  Formation  Group  for  Cell  Counts 

If  the  test  involves  call  count  (Binomial, 
Quantile,  McNemar,  Test  for  Independence,  Cochran),  the  data 
formation  group  Caller  is  called  by  the  main  program.  Cellar 
and  the  subroutines  it  calls  produce  the  cell  counts  used  as 
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the  raw  material  for  calculations  in  the  test  subroutine. 
Those  cell  counts  are  contained  in  the  vector  RCCT,  which  is 
dimensioned  by  the  number  of  rows  NROW  and  the  number  of 
columns  NCOL. 

c.  Data  Formation  Group  for  Data  Values 

For  all  other  tests,  the  master  reader  group  is 
called.  Mstrdr  and  its  subroutines  read  the  specified  data 
file  and  the  options  file.  They  produce  a  list  of  the  data 
elements  which  are  relevant  for  the  test,  i.e.,  Mstrdr 
includes  only  the  data  set  variables  indicated,  and  excludes 
any  blocks  or  pairs  specified.  These  data  values  are 
contained  in  the  vector  LIST.  LIST  is  dimensioned  by  the 
pointer  vector  LPTR.  LPTR  (1)  is  always  1.  LPTR(i)  is  the 
index  of  the  first  element  of  the  ith  variable  in  LIST. 
With  k  variables,  LPTR(k  +  1)  is  one  more  than  the  last 
elament  of  LIST.  For  example,  if  LIST  contains  three  vari¬ 
ables,  with  two,  four  and  seven  elements  each,  LPTR  will 
have  four  elements,  (1,3,7,14).  In  dereferencing  variable 
values,  the  ith  variable  will  have  elements  LIST (LPTR(i) ) 
through  LIST  (LPTR(i+  1) -1) .  N TREAT  is  the  number  of  vari¬ 
ables  included  in  the  test. 

d.  Utility  Subroutines  and  Functions 

Many  subroutines  are  called  by  more  than  one  test 
subroutine.  These  versatile  subroutines  are  grouped 
together. 

e.  Test  subroutine  Groups 

Each  of  the  24  tests  has  a  principle  subroutine 
called  by  the  main  program.  In  general,  these  subroutines 
perform  the  bulk  of  the  computations  for  the  test  statistic 
or  level  of  significance.  However,  some  of  the  computation 
is  done  by  calling  other  subroutines.  Subroutines  which  are 
called  only  by  one  test  are  listed  contiguous  to  that  test's 
subroutine.  Subroutines  which  are  called  by  more  than  one 
test  are  listed  in  the  utility  subroutine  group. 


The  test  subroutines  make  use  of  either  LIST  or 
BCCT  in  tfceir  computations. 

3 .  nagjiass 

Crunch  is  written  to  be  compatible  with  the  IBM 
Fortran  H  compiler.  Only  the  most  rudimentary  syntax  was 
used.  The  only  loop  statements  are  the  do  for  loops  of  the 
form, 

DO  10  I*LL,tJL 

where  10  is  a  line  label  for  a  continue  statement,  ll  is  the 
lower  limit  and  UL  is  the  upper  limit  of  the  loop.  All 

ether  loeps  or  branching  is  done  with  if  statements.  This 

very  primitive,  unstructured  Fortran  should  be  compatible 
with  virtually  any  computer  system. 

4 .  Input/Output  Files 

The  IBB  file  definitions  are  contained  in  the  file 

"CRUNCH  EXEC".  All  files  except  08  (terminal)  are  implic¬ 

itly  fixed  format,  logical'  record  length  80.  Files  01 
through  05  refer  to  "lochi  a"  through  "lochi  e" ;  file  06  is 
"problem  data";  file  07  is  "lochi  f";  file  10  is  "lochi  op"; 
files  11  through  16  are  "u  listing"  through  "z  listing". 

The  number  of  input  file  (NIF)  and  number  of  output 
file  (HOF)  are  listed  in  line  one  of  the  options  file,  and 
read  by  Crunch's  main  program.  Files  01  , . . .  ,05, 07,  and  10 
are  candidates  for  HIF;  files  11,... ,16  are  candidates  for 
HOF.  File  08  is  used  only  to  remind  the  user  of  which  output 
file  contains  the  test  result.  File  06  contains  system 
error  messages,  while  file  09  contains  intermediate  computa¬ 
tions  of  Crunch. 

The  main  program  reads  lines  one  and  two  of  the 
options  file.  Either  Celler  or  Mstrdr  read  most  of  the  rest 
of  the  data  and  options  file.  Fname  reads  the  data  set  name 
and  writes  it  to  the  output  file;  Vname  does  the  same  for 
the  variable  names. 
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The  test  subroutines  write  all  of  the  test  results 
files  "u  listing"  through  "z  listing",  except  the  data  set 
and  variable  names. 

5*  Dimensions  and  Peel arations 

it  the  beginning  of  the  main  program,  and  of  each 
subroutine,  the  variables  used  are  declared  and  the  arrays 
dimensioned.  All  declarations  and  dimensioning  is  explicit. 
Although  arrays  could  be  dimensioned  by  variables  in  subrou¬ 
tines,  this  is  not  done.  For  example,  with  TOTELM  as  the 
integer  representing  the  total  data  elements  of  the  test, 
LIST  could  be  dimensioned  in  a  test  subroutine  as 
LIST  (TOTELM)  vice  LIST  (40  0).  This  was  not  done;  as 
explained  above.  Crunch  is  intended  to  be  separable.  To 
change  the  capacity  of  the  program,  a  global  change  command 
may  be  issued,  if  the  installation  has  a  good  text  editor. 
Otherwise,  the  dimensions  must  be  changed  one  by  one. 

There  are  a  very  few  arrays  which  consume  inordinate 
amounts  of  storage  fer  what  information  they  provide.  They 
have  the  dimension  of  20,00  0;  that  dimension  allows  storage 
of  a  comparison  of  each  of  400  data  elements  with  each  other 
data  element.  On  the  IBM- 3033,  these  large  arrays  require 
the  user  to  request  1  megabyte  of  storage.  On  smaller 
systems,  these  arrays  may  be  redimensioned  down  to  exactly 
the  size  necessary  (if  k=the  number  of  elements,  k(k-1)/2). 
An  alternative  is  to  eliminate  them  altogether,  and  take  out 
test  procedures  which  use  them. 

6.  Description  of  Yari ables 

At  the  beginning  of  the  program,  there  is  a  brief 
description  of  all  variables  and  arrays  which  are  used  or 
passed  by  the  main  program.  After  the  subprogram  title 
line,  a  similar  list  defines  the  variables  used  exclusively 
among  subroutines  of  that  subprogram. 
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Above  the  variable  declarations  in  each  test  subrou¬ 
tine,  there  is  a  description  of  each  of  the  integer  and  real 
parameters  used  in  that  test. 

C.  EXTERNAL  SUPPORT 

The  non-INSL  subroutine  Shsort  is  essential  to  the  oper¬ 
ation  of  the  program  Crunch.  Its  documentation  is  available 
to  any  user  at  the  Naval  Postgraduate  School.  Any  system 
sort  routine  can  be  used  to  support  Crunch,  which  is 
currently  configured  to  pass  a  real  vector  of  values  to  be 
sorted,  an  integer  vector  (1,...,n)  to  receive  the  order  of 
the  original  data  values,  and  the  number  of  elements  n. 

The  non-IHSL  Dtest  was  used  for  the  Mann-Whitney  test. 
Its  value  is  that  it  not  only  computes  the  test  statistic, 
but  also  the  level  of  significance.  Modifications  to 
Crunch  could  easily  allow  computation  of  the  test  statistic. 
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APPENDIX  F 

FORTRAN  SOURCE  CODE:  CRUNCH 


THIS  PROGRAM  TAKES  DATA  FROM  TEST  A-F  FILE ,  INFORMATION 
FRCM  TEST  OF  FILE  TC  P  ERF  CRM  NON-P  ARAM  ETRI  C  TESTS 


SUBPROGRAM  GROUPS 


MAIN 


CELLER 


MSTRCR 


UTIL ITY 


TESTS 


VARS 

NIF 

IHOLD 

AVARS 

NTREAT 

TREAT 

NELMS 


TOTELM 

NTEST 

EXB 

NEXB 

IP 

IP1 


NIP 

RP 

NRP 

MF 

NOF 

LIST 

LPTR 


RNVARS 

RNELMS 

OICOP 


CONTAINS  THE  MAIN  PROGRAM,  FNAME ,  VNAME  »  REMIND 
READS  TOP  2  LINES  OF  OPTIONS  FILE,  WRITES  THE 
TCP  OF  OUTPUT  FILE,  CALLS  CELLER  CP  MSTROR, 

TEST  SLBRQUTI  NEi  REMINDS  WHICH  IS  CUTPUT  FILE 
CONTAINS  CELLER  ,  CQCH  ,TC  3.  TC3LN ,  C  ICO  T  ,TC4,TC4LN  , 
PAIRST  .MCOUNT.  FINDS  THE  CELL  COUNTS  FCR  FIVE 
NP  TESTS:  BIM  OMI AL  ,QUANT I  LE , MCNEMAR, COCHARAN  , 
ANC  RC  CONTINGENCY.  CELL  COUNTS  TO  RCCT; 
DIMENSIONED  BY  NROW  AND  NCOL 
CONTAINS  MSTRDR,  SUBLIST  ,NUL  1ST, 

REACS  THE  DATA  FILE  NUMBER  OF  VARIABLES,  # 
ELEMENTS/VARIABLE,  DATA  ELEMENTS.  INCLUDES 
VAR'S  CF  TEST  INTO  LIST,  EXCLUCED  PAIRS  OR 
BLOCKS  AS  REQ.  YIELDS  LlSTtDATA  ELMS), 
LPTR(PCINTER  FOR  LIST),  N TREA T ( A  OF  VAR  IN 
TEST!,  TOTELM  {#  ELMS  IN  LIST) 

CONTAINS  GENERAL  SUPPORT  SUBROUTINE  S/FUNCTI  ONS 
SUBPQUT  INES:  E XPF ,  F INDS,F I  NDT ,  MUVA R  ,NORMF , NORM  V , 
P  AFCOM  ,RANCUE  , RHOER,  RNQ, R  PARAM 
CONTAINS  THE  24  NONPARAMETR I C  TESTS:  USE  EITHER 
EITHER  LIST  OR  RCCT:  COMPUTE  TEST  STATISTICS 
OR  LEVEL  OF  SIGNIFICANCE:  WRI  TES  TC  OUTPUT 
FILE 

THc  ORCER  IN  THE  PROGRAM  IS  THE  SAME  AS  THE 
NUMBER  OF  THE  TEST,  NTEST 


NLM8EP  1-6  INDICATING  Fl  Lt  TEST  A-F  FCR  OATA 
INT  V  ECTOR!  1 6  )  GENERAL  PURPOSE  HOLDER 
INT  NLMBEP  OF  VARS  OR  TREATMENT  IN  DATA  SET 
INT  NLMBER  OF  VAR/TREATS  TEST  ACTUALLY  USES 
INT  V  ECTOR (1 6 )  INDICES  OF  TREATS  CR  VARS 
ACTUALLY  USE  0 

INT  VECT0RU6)  #  ELEMENTS  OF  CATA  IN  EACH 
VAR/TREAT;  IF  SOME  BLOCKS  EXCLUDED, 
NUMBER  REOUCED 

INT  TCTAL  »  OF  DATA  ELEMENTS  IN  DATA  SET: 

IF  SOME  BLOCKS  EXCLUDED,  NUMBER  REDUCED 
INT  INOEX  OF  NON-PARAMETRIC  TEST  TO  BE  USED 
INT  V  ECTQR12  00)  INDICES  OF  BLOCKS  TO  BE 
EXCLUDEDiMAY  BE  EMPTY 
INT  NLMBER  OF  ELEMENTS  OF  EXB 
INT  V  ECTQR{5  0  )  INTEGER  PARAMETERS 
INTEGER  VAR  FOR  CELLER:  MAYBE  NRCW  OR  SOME 
CTHER  VALUE  DEPENDING  ON  TYPE  OF  CELL 
CEFIMTICN  (SEE  TYPCEL) 

INT  NLMBER  OF  ELEMENTS  IN  IP 
REAL  VECTCRl 10)  REAL  PARAMETERS 
INT  NLMBER  OF  ELEMENTS  IN  RP 
INPUT  FILE  INDEX 
CUTPUT  FILE  INDEX 
REAL  YEC  TCR(  400  )  CATA  ELEMENTS 
INT  VECT0RU7)  POINTS  TO  THE  FIRST  ELEMENT 
OF  EACH  VAR  OR  TREATMENT'S  DATA  IN  LIST; 

THE  LAST  ELEMENT  OF  LP TR*T CTELM+i 
REDUCED  NUMBER  OF  VARIABLES 
VECTOR  OF  REDUCED  #  CF  ELEMENTS  PER  VARIABLE 
*-l  IF  DICOTOMIZE  BY  PARTITION,  ELS£*NUMBER 
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c 

c 

c 

c 

c 

c 

c 

c 

c 

c 

g 

c 

c 

c 

c 

c 

c 

c 

c 

c-~ - 


DI  VAL 
NSUC  C 
CELLS 
7YPCEL 


RCCT 

I 

ROwNR 

COLNR 

RORC 

NTORO 

CIV 

CRP 

TIEV 


CF  VALUES  OF  SMA4.LE 
C  IF  DATA  NOT  OICOT 
REAL  PARTITION  VALUE  OR 
IF  01  COTOMIZ  ED  DATA,  HOW 
INT  *1  IF  TEST  USES  CELL 
INTEGER  METHOD  OF  OEFINI 

1- 0  I  COT  PARTITION, 

2- 0  I  COT  LI  ST  ,3-R*C 


S  OF  SMALLER  SUBSET  CF  DATA, 

A  NOT  0 ICOTQMIZ  EO 
N  VALUE  OR  LIST  CF  SUBSET  VALJES 
D  OATA,  HOW  MANY  OF  CNE  SUBSET 
T  USES  CELL  COUNTS,  ELSE  =0 
0  OF  OEFINI NG  CELLS: 

PARTITION, 

LI  ST  ,3-R*C  PARTITION, 


LiOltJ” r  t  1  I  1UI,  , 

4-FIXED  ROW/COL 

COUNT,  5-ENUMERATED  CELL  COUNT 
RCW ,CCL  CELL  COUNT.  BY  RCWS  (RE  FIRST  VAR), 
G.  .ALL  COLS  OF  ROW  1,  THEN  ALL  COLS  OF  ROW  2 
VECTOR  OF  ROWS  TO  WHICH  1ST  VAR  ELEMS  BELONG 
VECTOF  OF  COLS  TC  WHICH  20  VAR  ELEMS  EELGNG 
INTEGER  LOGICAL  1*R0W  0*COL 
INTEGER  #  OF  VALUES  TO  REAO  (W/I  CELL ERJ 
INTEGER  VECTOR  CELL  INPUT  VALUES 
REAL  VECTOR  CELL  INPUT  VALUES 
INTEGER  VECTOR  OF  T  I  ES  :T  IEV  ( 1 )  .  L E ,T I  EV (2  )  .GT 
BOTH  WITHIN  EPSILN  OF  PARTITION  VALUE 


a 


INTEGER  NIFtNFILE  «IHQLD(  16)  ,  NVAR S , TO  TELM  , NTEST , 

♦  NTRE  AT, I P  ( 50 )  ,NRP,LP TR( 17 ),N ELMS ( 16) ,RN£LMS( 16), 

♦  DICOP,NSUCC»CELLS»NOF»CIV(50)»NT  CRD,  RCkC  ,  ROW  NR  (50  ) , 

♦  RCCT  (50)  »TVPCEL»TIEV(2)  «EXB(  200)  ,  TREAT)  16  ),  RN  VARS , 
♦REDE LM, CCLNR ( 50) 

REAL  RP(5C),LIST( 400 ) ,0 1 VAL ( 200 )  ,CRP(50) 

FORMAT)  16  15) 

FORM AT( 3  F1D.3  ) 

FORM AT(  IX, 161  5) 

F0RMAT(1X,8F1C.3  ) 

REAO(lOflCl)  NIF.NOF 
REAO  (  10 , 10 1)  NTEST, CELLS 
WRIT  E(9 , 2G1 )  NIF,NOF .NTEST 
TYPC  EL*  0 

IF  (  CELLS. EQ.l)  CALL  CELLER( NIF, NOF, NTEST, NROW.NCOL, 

♦  R  CCT,  LIST,  LPT  R,NVARS,NTRE  AT,  N  ELMS,  RP,  IP, 

♦  TREAT, TIEV  .TYPCEL) 

IF  (CELLS. EC. C)  CALL  MSTRDRl  NVARS, NELMS,  LIST, 

♦  LPTR  ,  TOT  ELM,  N  TREAT  ,N  IP,  IP,  NRP,RP  ,NI  F  ,  NOF  ,  TREAT ) 
IFUTYPCEL.NE.62J  .AND. (TYPCEL. NE. 72)  ) 

♦  CALL  VNAME(NIF, NOF, NVARS, NTREAT, TREAT) 

IF (N TEST • EC. 1 )  CALL  BINTST 

♦  ( NOF , RCC  T,RP , IP , ALPHAT ,C ILL, C  IUL ) 

IF (N TEST • EO. 2  )  CALL  QUANTU  NCF,  R  CCT,  RP,  I  P  ,ALP  FmT  , 

♦  Cl LL, C IUL, L  1ST, LPT R,T IEV, TYPCEL) 

IF(N  TEST .EQ.3  )  CALL  COXSTUt L 1ST , LPTR , IP, RP.NOF.NTESl 


IF (N  TEST • EQ.3  )  CALL  COXSTUt L 1ST , LPTR , IP ,RP, NOF, NTEST ) 
IF  (NTEST •  EC.4  )  CALL  MANWHTt L PTR , LIST ,NGF , RP ) 

IF  (N  TEST  •  EQ.  5  )  CALL  SGNT (  LI  S T,LP  TR  ,NQF .TOTELM  ,RP  ,  IP , 

♦  NT  EST  ) 

IFiNTEST .EC. 6  )  CALL  MCNEMR( RCCT , RP , I P , NOF ) 

IF(NTEST  .EQ.7  I  CALL  RCONT (RCCT, NROW ,  NCOL  , NOF  ) 
IFCNTEST.EQ.8  )  CALL  GOF IT  (L I  ST, L  PTR,  IP ,RP  , NO F  ) 

IF  (NTEST  •  EC.  9  )  CALL  WILCOX(  L  I  ST ,  LPTR  ,  IP ,  RP,  NOF) 

IF  (NTEST  •  EC.  1  C )  CALL  CORREL 

♦  (LIST  ,  LPTR, NTREAT  ,TREAT,NCF,  IP  ,RP) 
IF(NTEST.EQ.11)CALL  REGRET)  L I  ST,  LPTR  ,  IP  ,  RP,  TREAT  , NOF  ) 
IF (N TEST  »EC.12)CALL  MCNO FE( L 1ST, LPTR ,  IP , RP, TR EAT ,NOF ) 
IF  (NTEST  .EG.  15)CALL  MED  I A  (NTREAT  ,  LI  Si,  LPTR, NIF, IP  ,N0  F) 
IF(NTEST  .EQ.l^JCALL  KRUWAL 

♦  (LI  ST,  LPTR,  IP, RP, NOF, NTREAT  .TREAT) 
IF(NTEST.EQ.15)CALL  SCUARK 

♦  (LI  ST,  LPTR, NTREAT, TREAT  ,  IP, RF, NOF) 
IF(NTEST.EC.16)CALL  HARTLE 

♦  <Ll ST, LPTR, NTRE  AT .TREAT ,  IP , RF»NOF ) 
IFINTEST  «EG.17)CALL  FRIEDM  . 

♦  (NTREAT,  LPTR,  LI  ST,IP,RP  ,  NOF,  TOT  ELM) 
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IF(NTEST.£G.1£)CALL 


IF(NTEST 
IF  IN  TEST 
IF(NTEST 
IF(N  TEST 
IF  (N  TEST 
IF(NTEST 
CALL  REM 
STOP 
END 


.EC.1S)CALL 

•  EC*  2  Cl  CALL 

•  EQ.21ICALL 
.Ei.22)CALL 
.  E3.22JCALL 

•  EC • 2  A )CALL 
INQ(NCF) 


Cl  LADE 

(NTREAT,LPTR,LIST,NCF,TCTELM) 
CCCHRN(NROW,  NCOL  ,RCCT,IP  ,RP,NQF) 
KQLMCGi L I  ST  f  LPTR , TRc A 7, I P  ,RP ,NGF ) 
LILIEF!  L  1ST,  LPTR  .TREAT,  IP  ,RP,NJF  ) 
WiLKSH!  LIST'  LPTR,  TREAT,  I  P,RP,NJF  I 
SMIRN2I  LIST, LPTR,  TREAT, IP  ,RP  ,NOF  ) 
CRVONMI L  1ST ,  LPTR  ,  TREAT,  IP  ,RP  ,NOF  ) 


SUBROUTINE  REMIND(NOF) 
INTEGER  NCF 
WRIT  E (8 , 1  Cl ) 

FORM  AT!  1 X  »  '  TFE  RESULTS  ARE 
IF  (NOF.EC.ll  )  WRITE  (S,  11) 


LISTED 


FILE'  ) 


FORMAT!  IX,' 

IF  (N0F.EC.12  I 
FORMAT!  IX,' 

IF  (N0F.EC.i3) 
FORMAT!  IX,  • 

IF  < NCF. EC. 14  I 
FORM  AT  (  IX,  • 

IF  !N0F.EC.15  ) 
FORMAT!  IX,' 


U  LISTING') 
WRITE  (8,  12) 

V  LISTING') 
WRITE (8, 13) 
W  LISTING') 
WRITE (8,14) 
X  LISTING'  » 
WRITE  (8,  15) 

Y  LISTING') 


(N0F.EC.16  )  WRITE  (8,  16) 


FORMAT!  IX , ' 

RETURN 

END 


LISTING* ) 


SUBROUTINE  VNAME  <  NIF  ,NOF  »NVARS»NTRcAT  , TREAT) 

C  THIS  READS  EACH  VARIA8LE  NAME,  AND  IF  THE  VARIABLE 
C  IS  IN  THE  TEST,  WRITE  THE  NAME  INTO  THE  OUTPUT  FILE 
INTEGER  NIF  , NCF,  NTRE  AT,  TREAT!  16)  ,NVARS,I  ,J 
j*i 

IF  !  NVARS.5Q.  NTRE  AT)  WRITE!  NCF, 402  I  NVARS 

402  FORM  AT!  IX,  'ALL',  15, 

*  •  VARIABLES  CF  DATA  SET  USED  IN  THIS  TEST') 

IF  (  NVARS. NE.NTREAT)  WRITE!  NCF, 403)  NVARS, NTREAT 

403  FORMAT!  IX, 'OF', 15,'  VARIABLES  IN  DATA  SET  ,'  ,  /  ,5X  ,15, 

*  »  LSEO  IN  THIS  TEST') 

00  10  1  =  1,  NVARS 

REAO  (NIF, 200 
IF  (I.NE.TREAT(J))  GC  TO  10 
WRITE(NCF,4C1)  TREAT(J) 

401  FORMAT  (IX,'  THE  NAME  OF  VARIAB LE/TREATMcNT  ,  14) 

J*  J+l 

WRITE! NCF,2C0l 
10  CONTINUE 
200  FORMAT  (IX, 

*50HAAAABB£BCCCCDDCDE  EEEFFFFGGGGOOOOI I II J  J  JJKKKKLLLLMM) 
99  CONTINUE 
RETURN 
END 


GC  TO  10 
AT(  J) 

ME  OF  VARIABLE/TREATMENT  *',I4) 


EEEFFFFGGGGOOOOI  I II J  J  JJKKKKLLLLMM) 


SUBROUTINE  FNAME  (RORW,IF,OF) 

C  THIS  REAOS  THE  FILE  NAME  FROM  FILE  "IF", 
C  ANO  WRITES  THE  FILENAME  TO  OF  IF  RORW  IS  2 
INTEGER  RCRW.IF.OF 
IF  (  RCRW.GT.l  )  GO  TO  10 
REAO  (  IF, 200 
GC  TO  95 

10  WRITE  (CF  ,122  ) 

122  FORMAT  (  ICX,  '  NAME  OF  DATA  SET:') 

WRITE  (OF, 200  ) 

200  FORMAT  (  IX  , 


IF  RORW 


160 


nnonnnonnoonononnnnonnooonnnnnonnnoonnnnnoo 


V.  ■ 

V.' 


♦  48H  AAAABEBBCCCC DDDQ  EEEEFFFFGGGG0QQQ1 111  J JJ JKKKKLLLL > 
99  CONTINUE 

RETURN 

END 

Q**************  4* **4****** *4* ******************«*?***# #****« 

SUBROUTINE  CE  LLER  (NI  F,NQF,NTEST,  NROW  ,NCOL  ,RCCT, 

♦  LIST  ,  LPT  R,NVAFS,  N  TREAT,  NELMS  ,RP,  IP,  TREAT  ,  TI  EV  »TY  PCEL  ) 
THIS  SUBPROGRAM  CROUP  FINDS  THE  CELL  COUNTS  FOR  CNt  QF 

FIVE  NCNPARAMETRI  C  TEST — BINOMIAL,  QUANTILE,  CGCHARAN, 
MCNEMAR,  RC  CCNTINGECY  (  CHI  SQAURE  TEST  FOR  I NDEP I  —  AND 
PLACES  THOSE  CELL  COUNTS  INTO  THE  VECTOR  RCCT,  WHICH  JS 
DIMENSIONED  BY  NRCW  AND  NCOL.  UNLESS  THE  USER  HAS 
ENTERED  THE  CELL  COUNTS,  THE  SUBROUTINE  CALLS  SUBROUTINE 
MSTRCR  TO  R E AC  THE  DATA  SET  INDICATED  BY  NIF.  THE  DATA 
VALUES  ARE  IN  LIST,  WHICH  IS  DIMENSIONED  BY  LPTR. 
SUBROUTINES: 

DICOT  PERFORMS  THE  CICQTOMI ZAT  ION  FOR  THE  BINOMIAL  AND 
QUANTILE,  GETTING  #  SUCCES  S=RCCT(1)  AND  *  FAIL 
=RCCT(2  J  T  YPCE  L*2  IF  BY  LIST,  *1  IF  BY  PARTVAL 
COCH  GETS  THE  BLOCK  ANO  TREATMENT  SUMS  OF  SUCCESSES 
BY  PARTVAL!  BLOCK  SUCCESSES  INTO  RCCT(  I 
RCCT(B);  TREAT  SUCCESS  INTO  RCCT(B  +  ll  ,  .  ••  , 
RCCT(S*7);  TYP  CEL* 

MCOUNT  GETS  THE  FOUR  CELL  COUNTS  FOR  THE  MCNEMAR  TEST. 

RCCT(l)  IS  SUCCESS/SUCCESS  ,  RCCT(2)  SUCCESS/ 
FAILURE,  ETC.  TYPCEL* 

OIVICES  PAIRS  INTO  CELLS  OF  AN  RC  CONTINGENCY 
TABLE,  BASED  ON  PARTITION  VALUES.  FCCT<1) 

IS  FIRST  ROW  FIRST  COL,  RCCT<  2 )  IS  FIRST  ROW 
SECOND  COL,  ETC.  TYPCEL*3.  CALLS  PAIRST, 

TC2LN,  AND  RCC  TR 

SORTS  THE  VALUES  OF  THE  TWO  VARIABLES,  ANC  KEEPS 
AN  INDE >  KEY  ON  EACH,  IN  GRDER  TC  RE-PAIR 
THE  PAIRS;  KEY 1*RQW »  KEY2*COL 
CALLED  BY  TC2 •  ASSIGNS  AN  INTEGER  CORRESPONDING 
TO  THE  FOW  IN  WHICH  AN  ELEMENT  CF  THE  FIRST 
VARIABLE  FALLS,  OR  THE  COL  IN  WHICH  AN  ELEMENT 
OF  SECGNO  VARIABLE  FALLS,  BASEC  ON  PARTITION 
VALLES  CONTAINED  IN  VECTOR  CRV.  INTEGER  VECTOR 
RORCNR  CONTAINS  THE  INDICES  OF  ROW  CR  COL 
ASSIGNEC  FOR  EACH  VALUE 

RCCTR  TAKES  THE  RORCNR  VECTORS  FOUND  BY  TC2LN  OR  TC4LN 
ANC  FORMS  CELL  COUNTS 

TC4  DIVIDE  THE  OATA  PAIRS  INTO  THE  CELLS  CF  AN  RC 
CONTINGENCY  TABLE  BASED  ON  FIXED  ROW  AND  COL 
TOTALS.  TY PCE L*4.  CALLS  PAIRST,  TC4LN,  ANO 
RCCTR 

TC4LN  ASSIGNS  A  VALUE  TO  A  ROW  OR  COLUMN,  BASED  ON 

FIXED  RCW  OR  CCL  TOTALS;  THE  VECTOR  OF  ASSIGN 
IS  PCRCNR 

INTEGER  M  F,N  FILE,  IHCLDi  161  ,  NVAR  S,  TOTELM  ,NTEST» 

♦  NTREAT,  I  P  (  50)  ,NRP,LPTR(  171 ,  NELMS  (161  ,RNELMS(  161  , 

♦  DICOP, NSUCC,  CELLS, NO  F,C  IV  (50  i,NTORD,RCRC  ,ROWNR<50)  , 

♦  RCCT  (50  »  ,TYPCEL.KEY1  ,KEY2,K,KK,I  ,  J ,  J  J  ,  IP  1  ,NROW,  NCOL, 

♦  EXB(  200  J  ,  TREAT!  16  I  ,RNVAR  S,R£OELM  ,CQLNfi(  50  I ,  T  ,  LL  ,UL, 

♦  TI EV  <21 

REAL  RP(  5CI  ,  LIST  (400  J  ,DIVAL(2C0)  ,CRV  (50)  ,VAL1  (200), 

♦  VAL2  (200  ) 

READ  ( 10 , 1CI )  TYPCEL,  I  PI  ,  NCOL  ,NTORD 

101  FORMAT!  1615) 

102  FQRMAT(3F10.3  ) 

201  FCRMAT(iX,16I5) 

202  FORM  AT(  1X,8F1C.3) 

NROW  *1 

IF  (  TYPCEL. GT. 2)  NR0W*IP1 
WRITE  (  9,209)  TYPCEL  , NROW, NCOL, NTORD 
209  FORM  AT(  1  X  » 1  TY  FECE  L  ,NPCW  ,NCOL  ,NT  GRD,  '  ,415  ) 


TC3 


PAIRST 


TC3LN 


161 


ooooonnooon 


10 


12 

14 


NTREAT*NC0L 

NVARS*NC0L 


•  INC  ICES  ' ) 


*  INOICES') 

.1-1. 

.1-1. 


NEXB ) 
NEXB  ) 


T*TY  PCEL 

IF  (  (T.NE.5).AN0.(T.NE.61).AND.  <  T.NE.62)  .AND. 

*  (T.NE.71  )  ,ANC.(T.NE.72)  )  GO  TO  10 

REAQ(1C,1Q1)  (RCCT(I)  ,1*1  ,NTQRD) 

IF  (  (  T.EQ.62)  .OR.  <  T.EG.72.M 

*  CALL  FhAME  (1,NIF, NCF ) 

LIST! 11*0. C 
L  PTR ( 1  )*1 
NVARS*IP1 
NTREAT*IP1 
I  F IT . EC  .71  ) 

I  FIT. EC. 71  ) 

DO  5  KK*1 y  MRE AT 

TREAT(KK)*KK 
5  CCNTINLE 

NELMS  (11*0 
NTREAT  *NCCl 
READ  ( 10 , 1C  1)  NTRETE 
WRITE  (9t401J  NTRETE 

401  FORMAT!  IX, 'NUMBER  OF  TREATMENTS*  .15  , 

RE  AD  (10, 101)  (TREAT!  I  ) ,  I -1,  NTRET  E  ) 

WRITE  (9, 201)  (TREAT (  1 1 ,  1-1, NTRET E ) 

READ ( 10 , 1  Cl )  NEXB 
WRITE  (9,202)  NEXB 

302  FORM  AT(  IX  • 'NUMBER  OF  EX  BLOCKS'  ,  15' 

IF  (  NEX8.6C.  1  )  REAO(  10,101)  (EXB(l) 

IF  ( NEXB.Ct.l  )  WR  ITE ( 9, 201)  (EXB(l) 

CALL  RFARA  MNI  P»N  RP,  I P  ,RP  ) 

IF  (  (  T.NE.42)  .AND.(T.NE.72.)  ) 

*  CALL  FNAME!  1.10.N0F) 

CALL  FNAME  (2,9  ,9) 

CALL  FNAME  (2, 1 0,N  QF ) 

IF  (  (TY?CEL.GT.3).AN0.(  TYPCEL.LT.6)  )  GO  TC  12 
READ! 10,102)  ( CRV II ) , 1*1 , NTOR D ) 

W  RITE  ( 9  ,20  2 )  (CRV(I)  ,I*1,NT0RD) 

GO  TO  14 

READ  (10,101)  (CIV(I)  ,I*1,NT0RD) 

*WRITE(9, 2C1)  tCIV(I)  ,I  =  1,NT0RD) 

CALL  MS TRCR (N VARS, NELMS, LIST, LPT R.TQTELM, NTREAT, NIP, 

*  IP  ,NRP,RP,NIF, NCF, TREAT) 

IF  (T  YPCEL.LE.  2)  CALL  01  CCT(  NOF ,  N  TORD  , TYP CEL  ,  I  FI  ,CRV , 

*  L 1ST, LFTR, TOT  ELM,  RCCT,TIEV) 

IF  (T YPCEL. EG.  2)  CALL  TC3 (  NTR EAT  ,  LIST  , LPT  R  ,CRV  ,NT uRD, 

*  IP1  ,NCOL,  PCCT  ) 

IF (T YPCEL  •  EQ  •  4 )  CALL  TC4( NTR EAT , LIST , LPTR ,CI V , 

*  NTORC.IP  1,NCGL,RCCT) 

IF(TYPCEL.EQ.63)  CALL  MCGUNT  (LIS  T,  LP  TR  ,C  R  V,  RCCT , 

*  TREAT) 

IF(TYPCEL.EQ.  13)  CALLCGCH 

*  (NTREAT  ,NGF,  LIST,  LPTR,  RP  ,NTORD,  CRV ,  NRGw  ,  NC3L  ,RCCT) 
IF  (T  YPCEL.  EQ.  <3)  GO  TO  99 
WRIT  6(9,  |C1 ) 

WRIT  E  (Nu£  ,301  ) 

FORMATdX,*  CELL  MATRIX;') 

CO  80  1*1  ,NRCW 

L  L*l  +  (  1-1 )  *NCCL 
UL*I ♦NCCL 

WRITE  (9,201)  (RCCT(J),J*LL,UL) 

WRITE  (NOF  ,201  I  ( RCCT < J) , J=LL  ,UL) 

CONTINUE 
RETURN 

ENO_  _ 

”  SL8R0UTI  Ni"*COCH 

*  (NTREAT  , NCF  ,  LIST, LPTR,  RP»NTORD»  CRV.NROW  ,NCOL  ,RCCT) 

C  -  HIS  FINDS  WHICH  OATA  ELEMENTS  ARE  SUCCESSES, 


301 


80 

99 


16  2 


noon 


rsy 


BY  COMPARING  LIST  WITH  PT,  COUNTS  THE  #  OF  SUCCESSES 
PER  ROW  £  COL,  SENDS  TOTS  INTO  RCCT,  INDEX  l-NROw  = 
SUCCESS  PER  RCW(  BLOCK)  ,NROW+l  TO  NROW+NCGL=SUCCESS 
PER  CCL ( TREAT ) 

REAL  LI  ST(400  ),RP<50)  ,CRV<50)  ,PT  ,MULT,V 
INTEGER  NTRE AT,L  PTR( 17 ) , NROW ,NCO L, RCCT< 5 0  * , N , IC , IR, 

*  LL  »U  L  »NT 

N  =  IFIX<FLCAT(NTQRO)/2) 

IF  < N.NE.NTRE4T)  WRITE  (NQF,101)  N , NTREAT 
101  FORMAT!  IX, 

*  •  EXPECT  EC  NUPBER  OF  TREATMENTS  BASES  ON  NUMBER', 

*  /'OF  PARTITICN  VALUES  WAS', 14, 

*  •  WHICH  DOES  NOT  AGREE  WITH  THE', 

*  /'LISTED  NUMBER  CF  TREATMENTS, ',  14) 

NT=NROW+NCCL 

DO  5  1*1, NT 
RCCT (  I  )=0 
5  CONTINUE 

CO  10  1*1,  N 

MULT *Ch V ( <  2*1  I  ) 

P  T=C  R  V  ( (2*I)-1  )*MULT 
I C*I +NRCW 
LL«LPTF(I) 

UL*LPTR  (I*  11  —  1 
00  20  J«LL,UL 
IR*J-LL+1 
V3LI S  T ( J ) *MU  LT 
IF(V.GT.PT)  GO  TO  20 

FCCT  (IR  )«RCCT ( I R)  +  l 
FCCT(  IC  )  *RCCT(  I  C  )  +  l 

20  CONTINLE 

10  CONTINUE 

NRQWPl  =  NRC>-3+l 

WRITE  (  NCF  ,20  1)  (  RCCT  (I  )  ,1*1  ,  NROW  I 
WRITE  (NCF, 202)  (  RCC  T  ( J )  ,  J«NROWP  1  ,NT  ) 

201  FORM  ATdX, 'ROW  TOTAL  S  :•  /20I3  ) 

202  FORMAT!  IX, 'COLUMN  TOTALS  :'/20I3  ) 

RETURN 

END 


SUBROUTINE  TC 2 
*  (NTREAT, LIST, LPTR,C 
INTEGER  NTREAT, LPTR( 
*KEY2  <200  ),NPA  IR,  RGWN 
REAL  LI  ST (400  )  ,V  AL1 ( 
WRIT  E ( 9 , 1C1) 
FORMATdX,  •  INTO  TC3' 
CALL  PAI RST ( VAL1 , VAL 
CALL  TC3LN ( 1 , CRV , VAL 
CALL  TC3LN  (0,  CRV,  VAL 
CALL  RCC  TR (NRCW, NCOL 
RETURN 
END 


RV,NTORD,  NROW,  NCOL,  RCCT) 

17) ,NTORD, NRCW .RCCT (50) ,CIV(50)  , 
R  (50  )  ,CQLNR  (  50) ,  NCOL  ,  KE  Y1  (200) 
200) ,VAL2(200),CRV150) 

) 

2,<EY1,KEY2, LIST , LPT R  ,NP AIR) 

1  ,KEY1,N PAI  R  , NROW  ,NCCL,ROWNR) 
2,KEY2,NPAIR  , NROW , NCOL,  COLNR  ) 

, NPAIR, RCWNR , COLNR, RCCT) 


C  SUBROUTINE  TC4 


RETURN 

END  ^  __ 

SUBROUTINE  PA  IRS T 


163 


*  <VAL1,VAL2,K 
INTEGER  N7REA 

*  KEY2  (200  )  »NP  A 

*  NRCE  LM,  N  TCP » I 
REAL  LIST  (400 
WRITE  49 , 101) 
FORM  AT( 1 X  ,  '  IN 
NPAI R=LPTR (2 ) 
CO  10  1*1,  NPA 

J*I+NPAIR 
VAL1(  I  )*LI 
VAL2(  I  )*LI 
KEY1  l  I  )  =  I 
K EY2 (  I  )  =  I 
CONTINUE 
WRITE  (9  ,305) 
FORMAT  (IX,' 
WRITE(9 ,301) 
WRIT  E (9  »  301 ) 
CALL  SHSCRTIV 
CALL  SHSCFT (V 
WRIT£(9,3C2) 
FORM  AT  (  IX, « 
WRIT  E (9 , 301 ) 
WRIT  E  ( 9 , 3 Cl ) 
FORM  AT( 7  F  1C.2 


EY1  ,KEY2,LIST,LPTR,  NPAIR) 

T,LPTRl  17)  , NTO RD ,NR QW , NCCL »  RCCT  (  £0)  , 
1R,  RCWN  R  (50  )  ,COLNR(  50),  CIV  (  50  , 
CIV,ICRV,NIN,I  »J,JJ»K,RCPC,KEY1  (200) 
) ,  V  AL1(  200),  VAL  2(200)  ,CRV(50) 


C 

Cl  01 


RETURN 
END_  __ 

SUBROUTINE  TC 

*  (RGRC.CRV , VA 
INTEGER  NTREA 

*KEY2 (2001 ,NPA 
♦NRCELMtNTCP, I 
*KEY1  (200  )  .KEY 
REAL  LISTI4C0 
WRITE  (9,101) 
FORM  AT<  IX  » '  I 
ir*i 

IKRCRC.LT. 1 ) 
I CRV* 1 
K*0 

NTCP*NFCW- 
GC  TO  2C 
CONT  INUE 

I CRV*NFCW 
K  *NRQ  W  - 1 
NTOP=NFGW+ 
CONTINUE 
WRITE  (  9 , 2C3  ) 
FORM  AT(1X,'  IK 

*  '  VAL (  K )  ,CR 
IF  (  (ICRV.GT. 

*  GC  TO  12 

I CRV*  ICRV+ 
GO  TO  20 
RORC  NR( KEY (IK 
WRITE  (9.2G4) 
FORMAT  (IX, *K 
I  K*I  K  +  1 
I F  ( IK.GT. 
GO  TO  20 
RETURN 
END 


TO  PAIRST*  ) 
-LPTR  (1  ) 

IR 

ST  C  I  ) 

5T(  J ) 


VALUES  OF  VAL1 ,  VAL2 
(VALKI  ), 1=1, NPAIR) 
( VAL2  ( I  )  ,1=1,  NPAIR) 
AL1 ,KEY1, NPAIR) 

AL2  , KEY  2, NPAIR) 

VALUES  OF  VAL1 ,  VAL2 
(VALKI  ),  1  =  1, NPAIR) 
( VA L 2 ( I  )  ,1=1  , NPAIR) 
) 


BEFCPE  S CRT 


AFTER  SORT 


L,KEY,NPAIR,NROW,NCQL,RGRCNR  ) 

T,LPTR(  17)  ,NTOROtNRQW,NCCL,RCCT(  50)  , 
IR.RCWN  R  (50  )  ,COLNR(  50 )  ,  C  IV  (  50)  ,  I  K, 

CIV,  ICRV,NIN,I  ,  J,JJ  ,K ,  RCRC ,  RCRCNF(  50)  , 
(200  ) 

) ,  VAL(2  00)  ,CRV (  50) 

NPAIR 

NTO  TC3LN  , NPAIR:'  ,  15) 


NCOL-2 

IK , ICR  V  ,K , NTOP ,VAL (IK) ,CRV (  ICRV  ) 
,ICRV,K,NTOP:' ,415, 

V(ICRV)  '  ,2 F10.3  ) 

NTOP  ).0R.(  VAL(  IK).LE.CRV(ICRV)  )  ) 


) )= I CRV -K 

KEY(IK),  RCRCNR  (KE  Y  ( I K  )  ) 

EY(  IK)'  ,  15,  '  RORCNR  (KE  Y  (  IK )  )  ' ,  I  5  ) 

NPAIR)  GO  TC  99 


TS7 


INTEGER  NTREAT,LPTR(  17)  ,  NTORD,NRGk,NCCL .  RCCT1  50)  , 
*KEY2  ( 200  )  ,NPAIR,  RCWNR(50  )  ,C0LNR(  50  ) ,  C IV C  50)  » C  ELLnR  , 
*NRCELM,NTCF,ICIV  ,  ICRV  ,NIN,I  ,  J, J J  ,K , KEY1 ( 2C0 ) 

REAL  LI  ST  (400  ),VAL1(  2  00  )  ,  VA).2( 20  0)  ,CR  V(  50  ) 

WRITE  (9,101) 

101  FORMAT!  IX,  •  INTO  RCCTR') 

NRCELM=NRCw*NCOL 
DO  10  1=1  ,NRC  ELM 
RCCT ( I ) =0 
10  CONTINUE 

CO  2  0  1=1  ,NPA  IR 

C  ELLNR=  (ROWNR  ( 1 1-  1  )*NCOL+CQLN R(  I ) 

RCCT  (  CcLLNF  )=RCCT  (CELLNRJ+l 
20  CONTINUE 
RETURN 
END 

SUBROUTINE  TC4Ln7rOR  C  ,C  I V  ,KE  Y,NP  AIR,  NROW  ,  NCOL  ,RORCNR  ) 
INTEGER  NTREAT,LPTR(  17)  , NTORD,NRCW , NCCL,  RCCT  (50), 
*KEY2  (  200  ),NPAIR,RCWNR(50I  ,COLNR(  50)  ,CIV(  50  , 
*NRCELM,NTCP,  I  CIV,  ICR  V  ,N  I  N ,  I ,  J,  J  J  ,K,  RCRC,  RCRCNR(50), 
*KEY1 (200) ,KEY (200) 

PEAL  LI  ST  (400  1,VAL1(  200 ),  VAL  2(  20  0  )  ,CRV(  50  ) 

NRCE  LM=NRCW*NCOL 
IF  (  RORC.LT.l)  GO  TO  10 
NTCP=NFCW 
ICIV=1 
K.  =0 

GC  TO  20 

10  NTOP=NROW+NCOl 
I CIV=NPCW+1 
K  *NRO  W 

20  J=0 

CO  30  1=  IC  IV ,  NTOP 
N IN=C I V ( I J 

IF  (NIN.EQ.Q)  GO  TO  30 
DC  15  J J=1  ,NI N 

RCRCNF(KEY(  J ) )  =  I-K 
15  CONTINUE 

30  CONTINUE 
RETURN 
END 


c206 


SUBROUTINE  01 

*  TQTELM.FCCT 
INTEGER  CICOP 

*  NT  ORD, TYPCE 

*  KEY2(400),T 
REAL  DIVAU20 

*  EPSILN 
NSUCC-0 

T I  EV  ( 1 )  =  0 
TIEV(2)*0 
EPSI  LN*0.  CC01 
NPAI  R=LPTR(2) 
WRITE  (  9  ,206) 
FORMAT! IX, *CR 
•THIS  SECTION  I 
IF  (TYPCEL.EQ 
PT*CRV( i ) 
DC  10  1=1, 
IF  (L 
C  i  F*A 
IF  (D 


COTINOF  ,  NT  ORD,  TYPCEL,IP1  ,CRV  »LIST  ,LPTR, 
,TI  EV ) 

,LPTR(  1  7 ) ,  UL  ,1  ,  NSUC C, L,D, KEY  1(40  0)  , 
l.NPAIR  ,  IP1 ,RCCT(50 ) »NF AIL, I ,T1EW , 
CTELM 

C) ,  CRV(  200  ,LIST(TOTELM)  ,PT  , DIF,  ADI  F, 


-1 

(C  RV( I  1,1*1 ,NT  ORD) 

Y*,  7F10.3) 

F  DICOTQMIZE  BY  PARTITION— 

•2)  GO  TC  40 

npair 

1ST ( I).  GT.PT)  NSUCC  =NSUCC+1 
ES(  LIST  (  D-PT) 

IF.GT.EPSILN)  GO  TO  8 

IF  (  LIST  ( I  I  .LE.  PT)  TIEV(1)  =  TIEV(  1)  +  1 

IF  (LIST(I)  .GT.PT)  TIEV(2)*TIEV(  2)  + 1 


165 


8 

2  07 
10 


WRITE  (9,207  )  L  IST(  I  )»PT  , NSUCC 
FC3MAT11X  ,'LIST(  I),PT,NSUCC*  ,2F10.3, 15) 
CONTINUE 
GC  TO  99 

C - THE  FOLLOWING  SECTION  IF  DICOTQMI  ZE  8Y  LIST - 


40 


41 


42 


50 


GO  TO  9S 


CONT  INUE 
DO  41  1  =  1  ,NPA  IR 
K£Y1(  I)  =  I 
CONT  INUE 
CO  42  1  =  1  »NTQ  FD 
KEY2(  I  )  =  1 
0  I VAL  (  1  )=C  FV( I  ) 

CONTINUE 

CALL  SHS0PT1LIST  ,  KEY  1  »NPAIR ) 

CALL  SHSC FT (0  IVAL ,KE  Y2, NTQRD ) 

L=1 
D=1 

IF((  L.GT.NFAIF).OR.(  C.GT.NTQRO)  ) 

DIF=LIST(L  1-0  I  VAL  <D) 

ADIF=  AES(  0  IF) 

WRITE  (  5 « 2 C9 )  L  IS  T  ( L  )  ,DI V  AL(  D  )  ,0 1 F ,NS UCC 
FORMAT  (IX,  ’LIST, D  I  VAL,  01  F,NSUCC*  ,3F10.5, 15) 

IF  ( ACIF.G7.EPSILN )  GO  TO  60 
NSLCC=NSUCC+1 
L*  L+  1 
GO  TC  5C 

IF1DIF.GT.CI  D*D+ 1 
IF(OIF.LT.C)  L*L+  1 
GC  TO  50 

WRITE  ( 9 ,1C1 )  NSUCC, NPAIR 

FORM  AT(  IX  »  '  *  OF  STANDARD  SUCCESSES  1  ,15  , '  OF  1 
IF  (IPl.Et.O)  NSUCC* NPA IR-NSUCC 
C***REFLECT  THE  4  OF  SUCCESSES  IF  LIST  CONTAINS  FAILURES*** 


209 


60 


99 

101 


15) 


105 


15,'  OF 


15) 


OR  VALUES  ABOVE  PARTITION  ARE  FAILURES 
WRITE  (9,1051  NSUCC,  NPAIR 
WRITE  (NCF.lOf)  NSUCC, NPAIR 
FORM  AT(  1 X  ,  *  NLMBER  OF  SUCCESSES 
NFAI  L=NPAIR-NSUCC 
RCCT  (1)  =  NSUCC 
PCCT (2) =NFAIL 
RETURN 

END _  _ _  _  _ 

$U BR OUT 7 nI~ MC CUNT " ( L IStT L PT R7CRV 7RCCT , TR I AT ) 

C  THIS  GETS  CCUNTS  FOR  FOUR  CELLS  OF  THE  MCNEMAR  TEST: 
C  2  SUCCESS, 2  FAILURE,  SUC  CESS-FAI  LURE  , FAILURE-SUCCESS 
C  FBS=l.O  IF  SUCCESS  IN  FIRST  TREATMENT  IS  BELCW  CFV(l) 
C  =-1.0  IF  SUCCESS  IS  ABOVE  CRV(l) 

REAL  LI  ST  1400  I ,CR V <5 0 ) , FBS, S BS ,F PV ,S PV ,F ,S 
INTEGER  LF1RI  1 7)  ,  RCC  T  <4  )  ,N,  K  ,1  ,L  F  ,LS  ,  7RE  A 7(  16  ) 


N*LP  TR(  2  )  -1 
FBS=CRV(  3) 

SBS=CRV(  4) 

FPV*CRV(  1)*FBS 
SPV=CRV  (  2)*SBS 
LF=1  +  IF  I  X  (CRV  (5)  ) 
LS=2-IFIX(CRV<5)  ) 

CO  5  1*1,4 
RCCT (  I ) =0 
CONTINUE 
CO  10  1  =  1, N 
K  *1 

F  =LI ST ( I ) *FBS 
S  =  LI ST  ( I+N  )*S8S 
IF(F.GT.FPV)  K=K+LF 
I  F(S.GT.SPV)  K=K+LS 
RCCT(K)  =  RCCT(K)i-l 


10  CONTINUE 

IF  ( CRV(  5).LT.0.5)  GO  TO  20 
WRITE  (NCF.ill)  TREATU)  ,TREAT(2  ) 

GO  TC  30 

20  WRITE  (  NCF  ,11 1 )  TREA  T (2 )  ♦  TREAT  ( 1  ) 

111  F0RMAT(1X,'TREATEMENTS:  '/*  BEFORE', 15,'  AFTER',15) 

30  RETURN 
END 

£**************4*** 4****** ************ ******** ************ ** 
SUBROUTINE  MSTRDR(NVARS  ,NELMS,L I  ST, L PTR,  TCTELM, 

*  NTR  EAT  »  NIP  ,  I  F  ,NRP  ,R  P  ,NI  F  ,NGF,  TREAT) 

C  TFIS  SUBPROGRAM  GROUP  READS  ALL  OF  THE  DATA  FILE  EXCEPT 
C  THE  NAMES,  AND  LINES  4-7  CF  THE  CPTICNS  FILE.  IT  CHOOSES 
C  THE  DATA  ELEMENTS  OF  THOSE  VARIABLES  INCLUDED  IN  THE  TEST, 
C  AND  PUTS  THEM  INTO  THE  VECTOR  LIST.  IT  THEN  REMOVES  THE 
C  DATA  ELEMENTS  IF  THE  USER  INDICATED  TO  EXCLUCE  PAIRS  OR 
C  BLOCKS.  THE  INTEGER  VECTOR  LPTR  DIMENSIONS  LIST;  LPTR(l) 

C  =  1;  LPTR(I)  *  1ST  ELEM  CF  ITH  VAR  IN  LIST.  NTR  EAT  = 

C  *  TREATMENTS/VARS  IN  TEST;  TOTELM  IS  TOTAL  ELEMS  IN  LIST 
C  LPTR  (NTREAT  +  1 )  =  TOTELM+1.  NELMS  IS  INT  VECTCft  OF  ELMS 
C  PER  VAR  AFTER  BLOCKS/PAIRS  EXCLUDED;  TREAT  IS  INT  VECTOR 
C  OF  INDICES  OF  ORIGINAL  DATA  SET  WHICH  ARE  INCLUDED  IN  TEST 


=  1;  LPTR(I  )  *  1ST  ELEM  CF  ; 
it  TREATMENTS/VARS  IN  TEST;  ‘ 
LPTR  (NTREAT  +  1 )  =  TOT  ELM*  1  • 
PER  VAR  AFTER  BLOCKS/PAIRS  I 
OF  INDICES  OF  ORIGINAL  DATA 


I  S  INT  VECTOR 


PARAMS; 


IS  REAL  PARAMS 


SU8R0UT  INES 

MSTROR  READS  THE  FILES 

SUBLST  PREPARES  CATA  ELEMENTS  FOR  REDUCTION 
NULIST  INCLUCES  CNLY  TREAT  SPECIFIED  INTO  LIST 
EXBLCC  EXCLUDES  FAIRS  OR  BLOCKS  AS  REQ 

INTE  GER  N I  F  ,N  F  IL  E  ,  IH  OLD  ( 1 6)  ,  NVAR  S ,  TO  TELM  ,  EXB  (  200 )  , 

♦  NTREAT,  I  P  <50 1  ,LPTR  ( 1  7  ) ,  NELMS  ( 16  i  ,  RNELMS1  16i,RNVARS, 

♦  DICO  P, N  SUCC, C  ELLS, NT  EST. TREAT  (16)  »REOELM,  NRP 
REAL  RP(  1C),  LIST  (400  ),DIVAL(200) 

.01  F0RMAT(16I5) 

.02  FORMAT!  8F10.3) 

!01  FORM  AT (IX, 1615) 

102  FORM  AT<  1  X  ,  6F1  C.3  I 
CICOP=0 

CALL  FNAME  (1,NIF,N0F) 

CALL  FNAME  (2,N2F,9) 

CALL  FNAME  (2,NIF,N0F) 

READ (NIF  ,1011  (IHOLO! 1 1,1*1,16) 

NVARS=IHCLO(  1) 

LPTR  ( 1)  *  1 
TOTE  LM*  0 
CO  10  1*1 ,NVA  PS 
I  PI* I ♦ 1 

NELMS!  I  )*IFOLD( IP  1) 

L  PTR(  IP1)*LPTR(I)+NELMS(I ) 

T  GTE LM* TOT  ELM+NEL MS ( I ) 

.  0  CONT I NUE 

READ (NIF,  132)  (LI ST (I  > ,1*1, TOTELM  ) 

CALL  SU  B  LST(  LIST,  LPT  R.NVARS,  NTREAT, TOTELM, TREAT) 

CALL  RPARAM(N  IP, NRP,  IP, RP) 

WRITE  (9,407) 

,07  FORM  ATI  IX,*  LISTPTR  VECTOR  AND  LIST  AFTER  REDUCTION' 
WRIT  E(9, 201 )  (LPTR( I  )  ,1 *1 ,17 ) 

WRIT  E (9 , 202 )  (LIST(I  )  ,1  *1 , TOTELM  ) 

WRIT  E  (9  •  302) 

103  FORMAT  ( I  X,  '  N  VARS ,  TOTELM, NTE  ST,  N  TREAT  ,NEXB,NI  F,  NRP'  ) 
WRIT  E(9 , 2C1)  W VARS, TOTELM, NT EST , NTREAT ,NEX8, NIP, NRP 
RETURN 


LIST  AFTER  REDUCTION*  ) 


SUBROUTINE  SUELSTUI  ST,  LPTR ,  NVAR  S, NTREAT  ,  TOTE  LM,  TREA  T ) 
THIS  SUBROUTINE  READS  THE  VALUES  FOR  THE  INCLUDED  TREAT  St 
AND  EXCLUDED  BLOCKS,  IF  ANY,  ANO  REDUCES  THE  ORIGINAL 


C  DATA  SET  OF  TFE  DATA  FILE  INTO  THE  DATA  LIST  FOR  THE  TEST 
C  SUBROUTINE  NUlIST  INCLUDES  TREATMENT  Si  EXBLOC  EXCLUDES 


TREATMENTS 
REAL  LI  ST (4C0 ) 
INTEGER  LPTR  (  17 ) 


in  i  c  ijcr  iriM  in  *  NVA  RS»NTREAT*TQ  TELM  »TREAT(16),NcXB» 

*  RNELMSt  16  I  ,RN  VARS  ,EX  B  (200 » 

101  FORM  AT(  1615) 

102  FORM  AT( 8  P1C.3  ) 

201  FORMAT!  IX  1 161  51 

202  FORM  AT!  1X,SF1C.3  I 
CG  8  1*1 »N VARS 

TREAT  ( I  )=0 
8  CONTINUE 

READ ( 10 , 101)  MTREAT 
WRITE  19,201)  NT R EAT 

301  FORMATUX, 'NUMBER  OF  TR  EATME  NTS  •  ,15  ,  •  INDICES') 

READ  (10 ,1  Cl)  (TREAT!  I  I  ,1*1, NTREAT I 

WRITE  (9, 2C1)  (TREAT!  I  ) ,  I  *1,  N  TREA  T ) 

READ  (10,101)  NEXB 
WRITE  (  9 , 2C2  )  NEXB 

302  FORM  AT(1X.'NUMBER  OF  EX  BLOCKS',  15,'  INDICES') 

IF  (NEXB.G2.1)  RE  AD(  10,  101)  (EXB  (  I ).  1*1,  NEXB  I 
IF  (NEXB.GE.l)  WRITE  (9,  201)  ( EXB  ( I )  ,  1*1 ,  N  EXB ) 

CALL  NULIST 

*  (NELMS  .TOTELM, Li  ST,  NTREAT, TR  EAT  ,  LPTR  ,  PNVAPS  ) 

IF  (NEXB.Cc.il  CALL  EXBLOC(  LPTR,  TOTELM,Ll  ST, NTREAT, 

*  NELMS,  NEXB,  EXB,RNELMS) 

RETURN 

END 

SUBROUTINE  NULIST 

*  (  NELMS, TOTELM, LIST, NTREAT, TREAT, LPTR,  RNVARS) 
INTEGER  NI F, N  FILE,  IHCLD(  161  ,  TOT  ELM,N  TEST  ,  EXB(  200  I , 

*  NTRE AT, I P ( 50 )  ,NRP ,LP TR( 17) , N ELMS ( 16)  ,NULPTR(17) ,KP1, 
*RNVARS,NCF,  1 1  ,UL  ,  LL,  NVARS, TREAT (  i6) 

REAL  RP  (1C ),  LIST!  400  ) 

IF(NVARS. EC. NTREAT)  GO  TO  98 
CO  5  K*  1  , 1 7 
NULP TR  (K  I  *C 
5  CONTINUE 
1=0 

NULP  TR(  11*1 
CO  10  11*1  »NTFEAT 
I  IP1*II+1 
K*TREAT  (II  ) 

KP1*K+1 

LL*LPTP(K) 

UL*L  PTR ( KP  I )-  1 
DC  15  w*LL  ,UL 
1*1  +  1 

LIST!  I  )*L  1ST  (  J) 

15  CONTINUE 

NULPTRt  1 1  F II  *UL+  i-LL+NULPTR(  II ) 

10  CONTINUE 
TOTELM*  I 

WRIT  E  (9  ,  AC  8  )  (NULPTR  ( II  ,1*1 , 17) 

408  FORMAT!  IX, 'NULPTR' ,/ 1615) 

LPTR  (1)  *1 
CO  20  K*1  .17 

L  PTR  (  K  )*NU  LPT  R  (K) 

20  CONTINUE 
98  PNVARS*NTPEAT 
RETURN 

5N0_  _  _  _  _ 

SUBROUTINE  IxELOC 

*  (LPTR, TOTELM, LI  ST ,  N  TREA  T»NE  LMS ,  NEXB  ,EX3  ,  PNELMS) 


TREATMENTS' 


INDICES 


>  w  •  W  ^  ,  »» 

K*1 ,17 
ILP TR  (K  I  *C 
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INTEGER  MF»NFILE*IH0LD(16)  •  TOTE  LM,NTEST  .  EXB(200)  ,  I, 
*NTREAT, I P(50)  ,NRP  ,LP  TR(  17  J ,  NELMS  (  16)  ,£XI  (  200)  ,N£Xi,K, 

♦  RNELMSI  16)  »TEF1,NCF,  LL»  UL»NV ARS  •  TREAT  (16  )  ,  EXT  US,  J 
REAL  RP(  10)  t  L  1ST  (  400  ) 

IP  (  NEXB.EC.O  )  GO  TO  97 
K*0 

C  GET  INDICES  CF  LIST  ELEMENTS  TO  BE  EXCLUDED 
LPTR  ( 1 )  *  1 
CO  10  1*1 ,NTREAT 
DO  12  J* I* NEXB 
K*K  +  1 

EXi(K  )=LPTR(  I  )  +  EX6( J )-l 
12  CONTINUE 

10  CONTINUE 
NEXI  *K+ 1 
EXIINEXI  )*17 
K*1 
J*  0 

TEPi*TOTELM+l 

C  RELOAD  LIST,  EXLUCING  ELEMENTS  AS  INDICATED  BY  EXI 
DO  20  1*1 » TEP 1 
E  XTH I S*£X I (K) 

I  F  (  I.LT.E>THIS)  J*J+1 

IF  (  I  .LT »E >TH I S )  LIST( J) =LIST ( I ) 

IF  (I.GE.EXTHIS)  K-K*l 
20  CONTINUE 

C  CHANGE  LIST  PC  INT  ER,  NUMB  ER  OF  ELEMENTS 
LPTR  (1)  *1 

TOTE  LM-TCTcLM-N£XB*N TREAT 
CO  25  I *1 »NTREAT 

R  NELMS  ( I )  *  NELMS (T  PEAT  (I)  I  -NEX  B 
25  CONTINUE 

NT  PI  -NTREAT+1 
DO  30  I  * 2  » NTP 1 

LPTR  (  I  )*LP  TR(  I  )-(  I-l )  *NEXB 
3  0  CONTINUE 

LL*NTREAT+2  , 

CO  40  K  =  LL ,17 
LPTR  I  K)*0 
40  CONTINUE 
GO  TC  99 

97  00  98  1*1  »NTREAT 

RNELMS  (  I  )*NELMS (TREAT  ( I )  ) 

98  CONTINUE 

99  RETURN 
END 

£**«** ************* 4** **** ************ ******** *4***4****** ** 

C  THIS  SECTION  CONTAINS  THE  SUBROUTINES  CALLED  BY  MCRE  THAN 
C  ONE  TEST  SUBROUTINE.  THEY  APPEAR  IN  ALPHABETICAL  CRDER: 

C  EXPF  ,  FINDS,  FINOT,  MUVAR,  NCRMF,  NORMV ,  PAR  COM ,  RANQUE 
C  RHOER,  RNQ,  RFARAN 

SUBROUTINi  EXFFILIST  .N.RP.F.NQF) 

THIS  SU3R0LTINE  FINDS  THE  THEORETICAL  EXPONENTIAL 


C 

C 


OISTIBUTICN  F.  WHERE 
REAL  A,L.*,LIST(4C0J 
INTEGER  I 

l-rpTd 


L=RP(1 ) -SCALE 
»RP  < 50)  ,  F(400) 


PARAMETER  LAMBDA 


00  10  I  - 1  »N 
X*LIST ( I) 


10 

101 


) 

.0) 


F(  I)  *0 
F(  11*1 


Q-EXP(-(L*X) ) 


IFCX.LE.O 
IFtX.GT.O 
CONT  INUE 

WRIT E(NOF  ,101 )  L 
FORM  ATI  IX, 'WITH  HYPOTHESISED 


nn  a*  1 1  i  ^  ?  ni 

1  EX  PQNENT  IAL  */!X  ,'W  ITH  PARAMETER 


RETURN 


DI  3TRIBUTUI  CN  •  , 

LAMBDA* ',F1C. 4) 


16  9 


v  < 


C- 

c 

c 

c 

c 

c 


SUBR  OUT  I  NE  FINOS(LlST,LPTR,S,K) 

THIS  SUBROUTINE  FINOS  THE  DIFFERENCES  BETWEEN  TWC 
EMPIRICAL  COFS  FOR  THE  TWO  VECTORS  CONTAINED  IN  LIST 
1ST  VAR  X  HAS  *  ELMS  M  ANC  COUNTER  I;  20  VAR  Y  HAS  #  ELMS 
N  ANC  COUNTER  JJ  E  CDF  ARE  (I-D/M  ANO  (J-U/N;  STEP 
COUNTER  OF  SMALLER  VAR  UNTIL  N+l  OR  M+l,  THEN  INCR  OTHER 
REAL  EPSILN.FN.FM.X <1001 ,Y( 1001 ,DIF, ADI F,S< 200) , 

♦  ATMAX,ATMIN,T  SUP,  PDI  F  ,  LI  ST(40  01  ,RP<50)  ,TMAX  »TMIN 
INTEGER  I  ,J,  K,L  ,M^N  ,KEYI  (100),KEYJ{  100 )  ,  JJ  ,  I  P<  50 ) , 

*  L  PTR  ( 17)  ,N  Cr,  TRcA  T (1 7 ) , 1 1 
E°SI LN=0 *0001 

M*LP  TR( 2 ) -1 
N=LPTR( 3 )-LPTR(2) 

FN*FLQAT ( N  J 
FM=F  LCAT  ( N  I 
DO  10  1=1, M 
KEYI  (  I  )  =  I 
X< II =L I ST< II 
CONTINUE 

CALL  SHSCRT  (X  ,KEY  I,M  ) 

00  12  J=1,N 
K EYJ  (  J  J=J 
Y<  JI«LIST<  (M+JI) 

CONTINUE 

CALL  SHSCRT<Y,KEYJ,N  I 
K=0 
L=0 
1=1 
J=1 

CONTINUE 

If  <  (I.GT.M).ANO.(J.GT.N))  L*1 

FORMAT<  IX, 'ITERATION  *  ,  I  ,  J,  STOP?  '  ,41  5  ,  •  DIF:',F10.4) 
FORM  AT  (IX, 'SIX):  «  ,F10.4I 
II  »I 

IFU.GT.M)  1 1  *M 
JJ*J 

IF(J.GT.N)  J  J  *N 
0 1  F=  X  <  1 1  I  -  Y  ( J  J I 
AOIF  *ABS ( C  IF ) 

IF(AQIF.G7.EP£ILNI  GC  TO  25 
IF(I.LE.M)  I* I ♦! 

P  DIF* FLOAT  (1-1 l/FM-FLGAT ( J-l I /FN 
K  *K+  1 
S (Kl *PC  IF 

WRITE  <  9 , 1  Cl  I  K  ,  I  ,  J,  L ,  DI F 
IF  (K.GT.O)  WRITE  <9,1111  S(K) 

IF(J.LE.N)  J*J+1 
K  »K+ 1 
S(K)*PCIF 

WRITE  ( 5 , 1  Cl  I  K,  I  ,  J,  L ,  DIF 
IF  (K.GT.O)  WRITE  (9,111)  S(K) 

IF(L.EC.l)  GO  TO  30 
GC  TO  20 
CONTINUE 

I  F((  DIF.LT.O)  .AND.  (I  .LE.M) )  I  *1  +  1 
I  FUDIF.GT.OI  •  AND,  (J.LE.N)  I  J*J  +  1 
K  =K+  1 

S  (K)*FLCAT  <1-1  »/F  M-FLOAT (  J-l )  /FN 
WRITE  <9,1011  K,I,J,L.OIF 
IF  (K.GT.O)  WRITE  <9, Ill)  S(K) 

I  F<(  DIF.LT.O)  .  AND.  ( I.  GT.M). AND.  (J.LE.N)) 
IFHDIF.GT.O).  ANQ  •  (J.GT.N).AND.  (I.LE.  Ml) 

IF(L.LT.l)  GO  TO  20 
RETURN 
END 


10 


12 


20 

101 

111 


25 


J*J+i 
1*1  +  1 


30 
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SUBROUTINE  F I  NOT ( S»F  »N,  T  SUP  ,  TMI  N  .TMAX  ,NuF  I 
C  THIS  FINCS  THE  DIFFERENCE  IN  CDF*S  OF  THEORETICAL 

C  DISTRIBUTION  F  ANC  EMPIRICAL  DIST  S  AT  THE  X*S  DETERMINED 
C  BY  THE  VALUES  CF  THE  SAMPLE  DATA  VARIABLE  ELEMENTS.  Tr»E 
C  MAX  ABSOLUTE  CIFFERENC  IS  TSUP.  T  IS  VECTOR  OF  DIFFERENCE: 
REAL  S(  400  » F  ( 40  0 1 ,  T  S  UP  , ATMI N.ATMAX , TMIN , TMAX  ,T(  800  > 
INTEGER  N  •  It  I  I»NGF 
NP1*N+1 
T (  1)  *F(  1 l 
T ( 2) *FI1I-SI1 I 
CO  10  1 1  *2  tN 
1*11-1 
I  T2*  (  1 1*2  I 
I  T 1*  I  T2-1 
T  (IT1  )*F(  I  1I-S(II 
T  ( IT2  I  *FC  I Il-S  ( II  » 

10  CONTINUE 
TMIN*2. 0 
TMAX *-2. C 
CO  12  J  *  1 1 IT2 

IF(T{  JI.GT.TMAX)  TMAX*T(J) 

IF(T(  Ji.LT.TMIM  TMIN=T(JI 
1 2  CONT  INU  E 

ATMAX*ABS(TMAXI 
ATMI  N*ABS(TMIM 
TSUP*ATMIN 

IF4ATMAX.GT.T  SUP  I  TSUP*ATMAX 
WRIT  E(NO  F  1 101  I  TMAX. TMI N. TSUP 
101  F0RMAT(1X, 'MAXIMUM  DIFFERENCE,  F  (Xl-S  (XI  ,  *♦  F1C.  4/1X, 

*•  MINIMUM  DIFFERENCE  : 1 ,  F 10.  4/1X  » 

♦  •LARGEST  AESOLUTE  01  FFERENCE  :  • , F  10. 4  I 
RETURN 

END _  ^  _ 

. . .  I U 8R  0  UT  7  N i ~  MU  VAR  7  III  T  ,  nT  M U ,  V  A rT ”  ” 

C  THIS  SUBROUTINE  FINDS  THE  SAMPLE  MEAN  AND  VARIANCE 

REAL  LIST  (400  I, MU  ,VAR,XBARf  SUMSQ 
INTEGER  N 
XBAR*0.  0 
SUMS  C*0  •  0 
DO  10  1*1,  N 

XEAR*XE AR-»L IS T(  I) 

10  CONTINUE 

MU=X8AR/FLCAT (Nl 
CO  2  0  I  *  1  ,N 

SUMSQ*SUMSC+( LIST (I»-MUI**2 
2  0  CONTINUE 

VAR*SUMSC/FL0AT(N-1I 
WRITE  (9,1011  MU, VAR 
WRITE  (NQF.101I  MU.VAR 

101  FORM  ATI  1 X  , '  ESTIMATED  MU*  ,F10.3,  »  VARIANCE  *,  F10 .31 
RETURN 

END  _  __ 

SUBROUTlNi"NOFMFlLIST  ,nTrP,F  ,NOfT 
C  THIS  SUBROUTINE  FINDS  THE  THEORETICAL  NORMAL 

C  01  ST  I  BUT  I CN  F,  WHERE  MU*RP(  1 J  ,S  I  GMA  SCUAR  ED*RP(  2  I 

REAL  PX,MU,S!GSQ,X,LIST(400!  ,RP(  501  ,F (40  0 1 
INTEGER  I 
MU*R  P  (1  I 
SIGSQ*RP  (  2  I 
CO  10  1*1, N 
X*LI S  T ( I J 
Z  *(X-MU  I/S  1GSQ 
CALL  NCPMV  (1 ,  Z  ,PX  I 
F  ( II  *PX 


171 


10  CONTINUE 

UR  IT  E(NO  F  ,  101  I  MU, 31  GSQ 

101  FORMAT!  IX, 'WITH  HYPOTHESISED  DIS  TRIBUTUICN  NORMAL  '/i  X , 
♦  •WITH  PARAMS  HJ*  1  ,F1  0.4,  •  ,  AND  SIGMA  SQUARED® 1 ,F10.4 i 
RETURN 
END_ 

subr  out  I  nI"*ng  fmv  Tgpt  Ion  ,  zTp  ) 

C  IF  OPTIONS,  THIS  TAKES  A  Z  ( STAN  CARD  NORPALJ  ANO  RETURNS 
CAP,  IF  OPT  I 0N*2 »  TAKES  A  P  AND  RETURNS  A  Z 
C  NUMERICAL  INTEGRATION,  TRAPAZCIDAL,  STEP*0.1  STD  CEV 
C  INITIALIZE  SCTH  Z  AND  P  IN  CALLING  SUBROUTINE 

REAL  Z,  P  ,P  I,  CCNST,QLO,  NEW,  ZZ.PP,  CUMP  ,  X,  QELTP ,  EXCESP , 
*D£LX 

INTEGER  OPTION, I  ,J,UL, ONE 
FI-3 .1415527 
CUMP  *0 

CONS  T-i .C/(2*FI)**Q. £ 

CLD* CONST 

IF  ( CPTICN.EQ.2)  GO  TO  50 
ZZ=A8S(Z) 

U  L-I F IX (ZZ*10.0) 

00  10  1*1, U 

X»FLOAT< I  )*0 .1 
NEW»CCNST*EXP(-0.5*X*X) 

CLMP-CUMP+O. 0£*< OLD+NEW) 

CLC-NEW 

IF  (CUMP. GT.  0.48)  GO  TO  12 
WRITE  (9,104)  I, NEW,  CUMP, X 
104  FCRMAT(1X,»I  ♦  ,15,*  NEW, CUMP,  X*  ,3F7.3  ) 

10  CONTINUE 

DELX=22-X 

N  EW-CCNST*  EXP ( -0. 5*ZZ*ZZI 
CUMP=CLMP+C.5*DELX*rOLD+NEW) 


C.5-P 


C UMP*CUMP  +  C.5*DEL  X*(OLD+NEW) 

WRITE  ( 9 , 1 C4)  1, NEW, CUMP, X 
P-CUMP+C.5 

IF  (Z.LT.O.O)  P-0.5-CUMP 
GO  TO  95 
CONTINUE 
PP-P-0.5 
X»0.  0 

IF  (P.LT.0.5)  PP-C.5-P 
X-X+O.I 

NEw*CCNST*EXP(-0.5*X*X) 

OELTP  *0.  Q5*(  CLD+NEW) 

C CMP -CUMP *DE  LTP 
CLO-NEW 

WR  ITE (9, 1C9)  X, NEW, CUMP 
FCPMATilX , '  X  , NEW, CUMP* , 3F7.3) 
IF  (CUPP. LI. PP)  GC  TO  60 
EXCESP*CUMF-PP 
ZZ»X-0.1*£XCESP/DELTP 

z-zz 

IF  (P.LT.0.5)  2— ZZ 
WRITE  (  9 , 1  Cl )  OPTION, P,Z 
FORM  AT!  1  X  ,  •  OFTION,  PROB,  VAL  OF  Z;*,I! 


FORM  AT ( 

RETURN 

ENO 


OFTION, 


$ROB, 


VAL  OF  Z:»  ,15,  2F10.3) 


SUBROUTINE  PAFCOM  (V,  T  ,S  ,  NCF  ) 

C  THIS  FIND  THE  DIFFERENCES  BETWEEN  THE  ITH  ANO  JTP  ELEMENT 
C  OF  VECTOR  V,  FCR  ALL  IOJ,  1,...,T;  PRINTS  THE  DIFFERENCES 
C  ANO  THE  SUPPLIED  STANCARD  DEVIATION  S 
REAL  V(  1  7  )  ,S 

INTEGER  NCF,T  ,1,  J  ,TM  1  ,IP1 
WRIT E (NQF  ,101  )  S 


FORMAT!  9X, '//PAIRWISE  C  OMPAR  ISON  S//  •  /  IX, 


ONLY 


nnooonnonooonnoon 


*«  IF  THE  MILL  HYPCTHESIS  IS  REJE  CTED  •  /IX ,  •  THESE*, 

*•  COMPARISONS  HAVE  MEANING,  COMPARE  THE*, /IX, 

*•  OESIREC  CUANTI  LE  OF  THE  T  DISTRIBUTION  • /IX  , 

*•  TIMES  The  STANDARD  DEVIATION  *,F10.4,/1X, 

*•  WITH  ThE  FOLLOWING  ABSOLUTE  DIFFERENCES  IN  THE'/IX, 
*•  SUMS  OF  ROW  RANKS:  *//lX, 

*•  I  J  ISRm-SRlJJl  •/» 

TM1*  T-l 
CO  10  1*1, TM1 

i  pi* i 

DO  20  J*I P  1  »T 

x*v(  i  )-v( ji 

AX*AB SIX) 

WRITE  (NOF  ,102  >  I,J,AX 
102  FCRMA1(1X,2I5,3X,F10»4) 

20  CONTI KLE 

10  CONTINUE 
RETURN 

EN0_  __  ______ 

~~”IuBR  OUT  I  nI~RANQuI7nV  ALS  ,  VALS  ,SVALS»RANK,  Q  ,k!y  ,NTREAT  , 

*  NB  LOCK  ,  CRT  I  CNI 

DIMENSION  VAL  SIN  VALS  ) ,Q(200)  ,RANK(NVALS!  ,RANQ(400J  , 

*  SVAL  S(NVALS)  ,  KEY  (  NVA LS )  ,  JJV 1 400  )  , RANGE (20  01 ,  QKEY  ( 200  ) 
INTEGER  KEY, NEWT  IE,OLDTIE,LL  .NTREAT , NBLOCK, OPTION, 

*  JJV,NL,UJ,QWEY 

REAL  DIF,  VALS  ,SVALS,  RANK1  ,R  ANK,  A  VE  ,E  PSILN  ,RANC,  Q ,  RAN  GE 


THESE 


SUBROUTINES  RANK  VALUES 


Q 

DIF 

EPSILN 

NEWTIE 

OLDTIE 

KEY 

RANK 

LL 

AVE 

RANG 


THE  RANK  CF  RANGES  WITHIN  BLOCKS 
DIFFERENCE  BETWEEN  KTH  AND  K  +1TH 
LN  TOLERANCE  CF  RCUNQING  ERROR,  I.E. 

SAY  TWO  VALS  ARE  TIED 
IE  KTH  ANC  K-HTH  SORTED  VALUES  ARE 
IE  K-lTh  ANO  KTH  SORTED  VALUES  ARE 
ORIGINAL  INDEX  OF  V  ALS,  E  .G.  ,  KEY  ( 
OF  SMALLEST  VAL,  KEY(NVALS)  HAS  I 
RANK  CF  TFE  UNSORTED  (ORIGINAL! 
LOWEST  INCEX  CF  TIED  SORTED  VALU 
AVERAGE  OF  LL  &  HIGHEST  INDEX  OF 
)  IF  0PTI0N»1, TOTAL  SET  OF  RANKS, 
=3  RANKS  W/I  BLOCK  PLUS  RANKS  OF 

IF  (OPTION. GE.2J  GO  TO  51 

NL*N  VALS 

CO  10  1*1  ,NVALS 

S  VAL  S ( I  I* V  ALS (I) 

KEY( I  1*1 
CQNT  INUE 

CALL  RNQ  (SV ALS,  KEY,  NL,RANQI 
CO  15  1*1  ,NVALS 
RANKC  I)»RANQ( I ) 


SORTED  VALUE 
, CLOSE  ENOUGH  TO 

TIED  (LOGICAL  INTi 
TIED  (LOGICAL  INTI 
1)  HAS  ORIG  INDEX 
NDEX  CF  LARGEST  VA 
VALUES 
ES 

TIED  SORTEO  VALUE 
*2  RANKS  W/I  BLOCK 
RANGES  OF  BLOCKS 


10 

CQNT  INUE 

CALL  RNQ 

CO  15  1*1 

RANK(  I 

15 

CONTINUE 

GO  TO  95 

51 

NL*NTREAT 

00  20  1*1 

20  1*1  ,NBLCCK 
QKEY  (11*1 
DC  25  U*1 , NL 

JJ*I«-(J-i)  *NBLOCK 
SVALS(  J  )*VALS(  J  J  ) 

JJ  V ( J! **J 
KEY  (  J  )*  J 
CONTINLc 

CALL  RNC  (  SVALS,KEY,NL,RANQ! 
RANGE  (  I  )*S  VALS  (NL  )-SVALS(  1! 
WRITc  ( 9 , 1 C7)  (RANC(K)  ,K*1,NL  I 


107 


30 

20 


'RANG ( J  ) 


FORMAT  (IX ,  7F10.3J 
DC  30  J=1,NL 

RANK ( 4JV ( J  ) ) 
CCNTIMe 
CONTINUE 
IF  (  OPTION. EQ.3I 
RETURN 
ENO 


95 


CALL  RNG( RANGE, GKEY ,  N8L0CK,  G I 


SUBROUTINE  RHCER  (  X  iNPAI  R  »RHQ  ,81  ,  BO, OPTION  J 
THIS  SUBROUTINE  TAKES  X,  A  VECTOR  COMPQSEO  QF 
.ENGTHED  VARIABLES,  AND  FINDS  THE  CORRELATION 


TWO  EUUAL 
COEFFICIENT 


10 


c  _ 

C  LENGTH  _  _ 

C  RHO  (PEARSON  *S:  COVARIANCE/  STDE  Vi*ST  DEV2  ) 

REAL  X(  400  ) »RFO»  S  SA,  SSB,FN,A8,DA  ,DB, AMEAN  ,6ME AN, 
*81,80,RHC,CENCM 
INTEGER  NPAIR, OPTION 
FN*F  L0AT ( NFAI R ) 

AMEAN-O*  C 
GM  EAN=0  •  0 
SSA*0.0 
SSB*0.0 
ABO  .0 
Bl-0.0 

eo*o.o 

CO  10  1*1  ,NPA  IR 
AMEAN*AMEAM-X(  I) 

BMEAN*BMEAN+X  (  ( H-NPAIR  l  i 
CONTINUE 
AMEAN=AM  EAN/FN 
EMEAN*BM  EAN/FN 
DO  20  1*1, NPAIR 
OA*X(  Ii-AMEAN 
D B*X (  (  I-»NP AIR  )  I-8MEAN 
A8*AB  +  (QA*C3) 

SSA*SSA  +  (DA*OA  ) 

SSB*SSE*(OE*OBI 
CONTINUE 

DENOM*(  SSA,M,0.5I*(SS8**Q*5) 

RH0*A8/0  ENCM 

IF  (OPTION. EQ.l)  61* AB/SSB 
IF  (  GPTICN.EQ.l)  BO*  AMEAN-B1*BME AN 
IF  (  CPTICN.EQ.2)  81*AB/SSA 
IF  ( OPT ICN.EQ  .2)  BO* BMEAN-B1*AME AN 
WRITE (9  » 122 1  FHO.EO,  81 
FORMAT!  IX  ,  •  RFO,  8C,B  1:*  ,3F10.4I 
RETURN 
ENO 


20 


122 


,KEY ,NL , RAN  0 ) 

AND  RANKS  A  VECTOR  FROM  RANGUE 
ARE  UNSORTEC  VALUES  WHICH 
DETERMINE  RANKS  i  KEY  IS  THE 
ELMS  IN  SVAL;  RANQ  IS  RANKS 
START  ANO  ENO  TIED 


SUBROUTINE  RNC  (SVAL 
THIS  SUBRCUT I FE  SORT 
OR  ANY  OTHER  SUBRCUT INEi SVALS 
WHICH  ARE  RETURNED  SORTED  AND 
VECTGR  1  »••  •  , NL  WHERE  NL  IS  * 

OLOT  IEtNEWTI  E  ARE  LOGICALS  TO 

REAL  SVALSINL  >,RANQ(  NL)  ,EPSILN, 

♦AVE, DIF 

INTEGER  M»I»  LL,  IP1,  NEWTIE,OLDTI  E,KE Y( NL  )  ,K 
EPSI  LN-O.QOOl 
WRITE(9,*22I 

FORM  AT  ( 1  OX  »•♦***  START  RANKING  IN  RNQ  ****•) 
CALL  SHSCRT (  SVALS  *KEY  »NL> 

CLOT  IE*0 
CO  10  I  *  1 ,  N  L 


VALUES 


222 


c 

Cl  04 
C  * 


C 

107 
Cl  08 


K  *KE  Y  4 1 1 

R  ANQ ( K  )»FLCAT (  I) 

I F  {  I  .GT.l  )  WRITE  (9, 1041  I , K , RANQ <KJ , RANQ <1 ) 
FORMAT  (IX, 'I. K  ',215,'  RANQ(  K)  *,F10.3, 

'•  RANC<  I  )•  ,F1C.3  ) 

I P1«I*1 

DIF*SVALS(  IPl)-SVALS(I) 

IF  (  I  .EC. Nil  OIF-1.0 
IF  <  DIF.LT.EPSILN  )  NEWTI E=1 
IF  (NEWTIE.EQ.GLOTIE)  GO  TO  29 
IF  (NEHIE.LT. CLDTIE)  GO  TO  15 
Ll*I 

C-C  TO  29 

AVE*0.5*(F10AT(I+LL)  I 
00  20  J*LL  .1 

RANG  (  KEY  ( J  ))  «AVE 
CONTINUE 
CLOT  IE*N  EWTIE 

WRITE  ( 9 , 1C7 )  (RANC(K)  ,K=1,NL  ) 

CONTINUE 

W  R  IT  E  ( 9  »12 1 1 

F  ORMAT ( IX ,  '  SORTEC  VALUES') 

WRITE  ( 9  , 1 C 71  (SVALS(K)  ,K=*1,NL) 

WRITE  (9.122) 

F ORMAT ( IX  •  '  RANKS  CF  VALUES  ') 

WRITE  (9,1C7)  (RANQ(K)  ,K*1,NL) 

WRITE  (9.1C8)  (K£  Y (K ) , K=1  ,NL ) 

FORMAT  (IX,  7F10.3) 

FORMAT  (IX,  7110) 

RETURN 
END 


SUBROUTINE  RP  ARAM  (NI  F  ,NRP,I  P  ,RP  ) 

C  THIS  REAOS  THE  INTEGER  ANO  REAL  PARAMETERS  OF  1 
C  FILE  FOR  EITHER  M STROP  OR  CELLER 
INTEGER  MP,NFP,IP(50) 

REAL  RP(1C) 

101  FORMAT(  16  15  ) 

102  FORMAT(8F10.3) 

FEAO  ( 10 , 1C  1 )  MP 

IF  (NIP.GE.l)  REAC(1 0,101)  (  IP ( I  ), 1*1, NIP) 
READ  (10, 101)  NRP 

IF  (NRP.GE.l)  RE  AC  ( 1  0 , 1 02  )  (  RP(  I  ) ,  I*  1  ,NRP  ) 
RETURN 
ENO 


OPTIONS 


SUBROUTINE  TH  YPA  (  X  ,NCF) 

C  THIS  WRITES  TFE  RESULT  FOR  A  KOLMOGOROV,  LILLIEFORS 
C  TEST  TO  THE  CLTPUT  FILE  WHEN  THE  HYP*1 
WRIT E(NOF  ,  101 )  X 

101  FORM  AT(  1 X  , '  TFE  NULL  HYPOTHESIS:  F*(  X  )  =  S  (  X)  '  /  IX, 'THE  • 
*«  TEST  STATISTIC  IS  LARGER  OF  THE  ABSOLUTES  VALUES'/ 

* 2X , '  OF  TFE  TWC  DIFFERENCES:  '  /IX,  F10.  A) 

RETURN 
ENO 


VALUES'/ 


C  THIS 
C  TEST 


SUBROUTINE  TH  YPB  (  X  »N  CF) 

WRITES  THE  RESULT  FOR  A  KOLMOGOROV, 
TO  THE  CLTPUT  FILE  WHEN  THE  HYP=2 
WRIT £<NQF  ,101  )  X 

FORM  AT(  IX.  'THE  NULL  HYPOTHESIS:  F*(X 
•  TEST  STATISTIC  IS  LARGEST  OIFFEREN 
2X  ,*  OF  F*OI ) -S(  XI) :  '/1X,F10.4) 
RETURN 
ENO 


LILLIEFORS 


HYPOTHESIS:  F*(  X) <=S ( X) • / IX  ,  • THE • 
LARGEST  DIFFERENCE  '/ 

•/1X,F10.4) 


SUBROUTINE  THYPC(X,NOF) 


c 

c 


CR  A 
WHEN 


KOLMOGOROV  * 
THE  HYP=3 


LIL  L IEFORS 


HYPOTHESIS:  F*(  X  )  =>S  (  X)  ' / IX, 
SMALLEST  (OR  MOST  NEGATIVE)' 
• /1X.F10.4) 


•THE 

/ 


c 

c 

c 

c 

c 

c 

c 

c 


THIS  WRITES  THE  RESULT  F 
TEST  TO  THE  CLTPU7  FILE 
WRITE(NGF,1011  X 
101  FORM  AT(  1  X  * '  TH  E  NULL 

*  •  TEST  STATISTIC  IS 
*2X  *•  OF  F*(XI  )-S<  XI): 

RETURN 
END 

(!♦♦*♦  *****  ****  *  *  ***  4**  ♦  #♦*  ************  ********  *4*********** 

C  THIS  SECTION  OF  THE  PROGRAM  CONTAINS  THE  TEST  SUBROUTINES 
CIN  THE  ORDER  OF  THEIR  TEST  NUMBER;  TESTS  ARE  SEPARATED  BY 
C  -/-/-/ 

SUBROUTINE  BI  NTST  ( NOF  ,RCCT,  R  P,  I P  ,  ALPH  AT,  C  ILL  ,  CIUL  ) 

THIS  MASTER  SUBROUTINE  FINOS  ALPHA  HAT  OF  A  BINOMIAL  TEST 
BASED  ON  THE  TOTAL  NUMBER  N,  THE  NUMBER  OF  SUCCESSES  T, 

THE  HYPOTHESIZED  PROBABILITY  PS  T  AR,  A  NO  FORM  Cf  HYPOTHESIS. 
THE  NULL  HYPOTHESIS  IS  EVALUATED.  IT  THEN  FINDS  LOWER  AND 
UPPER  CONFIDENCE  INTERVAL  BOUNDS  <CILL,CIUL)  FOR  ACTUAL 
PROBABILITY  BASED  ON  THE  NR  CF  SUCCESES  AND  FAILURES. 

IF  N <20,  EXACT  BINOMIAL  COMPUTATIONS  WILL  BE  PERFORMED, 
OTHERWISE  THE  NORMAL  APPROXIMATION  WILL  e£  USED. 

INTEGER  NOF,  IP(  50),  RCCTI50)  ,N,T,HYP 
REAL  ALPHA, AL PH AT, C IUL,CILL ,RP( 50 ) , PSTA R  ,ALPhYP , 

*  NKV  ( 21 ) 

T*RCCT(  1  ) 

N=RC  CT( 2 )  +T 
PSTA  R*RP  ( 1 ) 

ALPH  YP=1  .C-RP  (2 ) 

HYP* IP( 1 » 

WRITE  (  NCF  , 20  1)  N  ,T,  PSTAR ,ALPHYP 
FORM  AT ( 1  0 X  ,  •  B  INOM  IAL  TE ST  :  •  /  IX,  '  OF  • ,  1 5 ,  • 

*  'OF  DATA  SET', 15,'  WERE  SUCCESS  ES*  /  IX , 

*  •  THE  HYPOTHESISED  P  STAR  IS:',F10.5/ 

*1X,'  THE  LEVEL  OF  SIGNIFICANCE  IS:',F10. 

IF  (HYP. EC. 1)  WRITE  (NOF, 301) 

FORMAT!  IX, 

*  •  THE  NULL  HYFOTHESI 
IF  (HYP.  EC. 2)  WRITE 
FORM  AT( 1 X  , 

*  '  THE  NULL  HYFOTHESI 
IF  (HYP. EC. 3)  WRITE 
FGRMATI1X, 

*  •THE  NULL  HYPOTHESIS  IS  P>*P  STAR,  ALT  IS  P<P  STAR') 

IF  (N.GT.20)  C-0  TO  30 

CALL  B  INH2  CIN, T, PSTAR,  HYP ,  ALP  HAT  » NKV ) 

CALL  CIBINC(N,  T,ALPHYP  , NKV, Cl  LL,CIUL) 

GO  TO  S9 
CONTINUE 

CALL  BINOPMINjT, PSTA R , HYP, ALPHA T , ALP HYP  »CILL,CIUL) 
CONTINUE 

WRITE  (9,101)  ALPHAT  ,CILL,CIUL 
WRITE  (NOF, 101)  ALPHAT, CILL,C1UL 


201 


301 


302 


3  03 


30 

99 


ELEMENTS  ', 


5) 


S  IS  P*P  STAR, 
(NOF, 302  I 


ALT  IS  POP  STAR'  ) 


S  IS  POP  STAR,  ALT  IS  P> F  STAR') 
(NOF, 303) 


101 


ALPHA  HAT,  Cl  LOWER,  UPPER' ,  3F10. 21 


FORM  ATI  l  X  ,  ■ 

RETURN 
END 

c- 

SUBROUTINE  BINORM 

*  (N  ,7,  PS  TAR,  HYP  ,  ALPHAT  ,  ALPH  YP,C  ILL  ,  C IUL  ) 

THIS  SUBROUTINE  FINOS  THE  ALPHA  HAT  CF  A  BINOMIAL  GIST 
ANC  THE  CONFIDENCE  LIMITS  OF  THE  SAMPLES'  PRQ8A8H  TY 
CALLING  THE  SUBROUTINE  NORMV 


ALPHYP  THE  ALPHA  SPECIFIED  IN  TEST  I. O-LEVEL  OF  SIGNIF 
ALPHAT  THE  ALPHA  RETURNED  BY  THE  NORMV  SUBROUTINE  BASED 
ON  THE  NUMBER  OF  SUCCESSES  AND  TOTAL  SAMPLE 
ZHAT  THE  EMPIRICAL  l  =  STANDARD  NORMAL  N  SUCC  MINUS 

E ( N  SUCCESS )*( N*PSTAR) ,OIV ICED  eY  STD  DEV  (NPQ ) 
ZHYP  Z  RETLRNEC  FRCM  NORMV  BASED  CN  ALPHA*  OF  HYP 


*.\v> 


176 


non 


.'-A A 


r^rrrarr^r 


INTEGER  N ♦ T»H  YP 

REAL  ZHAT ,  Z  »ALPH»  ALP  HAT  ,  CILL  ,CI  UL  »NP ST  ,S TDEV  , ZHYP 
NPST*FLQA7(NJ  *PSTAR 

ZHAT  *(FLCAT(T  )-NPST)  /  l(  NPST*  (l.O-PSTAR)  ) **0.5 ) 
CALL  NOR  MV (1 »  ZHAT  »AL  PHAT I 
IF  (HYP.G7.il  GO  TC  20 

IF  (  ZHAT.G7.0)  AL PHA T= ( I . O-AL PHAT ) *2. 0 
IF  IZHAT.LE.O)  AL  PHAT* ALP  HAT  *2.0 
GC  TO  2C 

20  IF  (  HYP  .  EC .2  i  ALPHAT  =  1.0-ALPHAT 
30  STDE  V  =  ( FLCAT ( T*( N-T)  )/( ( FLOAT (N I  1**3)  1**0. 5 
CALL  N0RPVC2, ZHYP.ALPHYP) 

TD IV  N*FLC AT  (T  I/FLOAT  <N> 

CILL*TDI  VN*ZHYP*$TDE  V 
CIUL=TDI  VN-ZHYP*STDE  V 
RETURN 
END 


10 

101 

102 


SUBROUTINE  NCFUZK (N,  NKV ) 

THIS  SGUSRCLTINE  PRCDUCES  THE  NKV  VECTOR  OF 
FOR  K-  0 1 . . .  THE  INDICES  OF  NKV  ARE  ONE 
HIGHER  THAN  THE  CORRESPONDING  K,E.G.,I=1,. 
REAL  NUM,C£NQP»NKV(21) 

INTEGER  ULfMECiNPltl  P1.NP1MI  ,N,  I 
ISP1=N+1 

LL*I  F IX  (MO.  SI 
PED* IFIX  ( NP1 <0. 5 ) 

NKV( 11*1 
NKV(  NP1 1*1 
NUM* 1 .0 
DENO  M*1 . C 
CO  10  1*1. UL 
IP1*I+1 
NP1MI*NP1-I 
NUM* NUP  *FL  CAT  (NP1MI ) 

D £NOM*CENC P*F LCAT  (I) 

N KV(  I  P  1 1  *NLM/DENO  P 

I  F<  Z • LT  . - 

CONTINUE 
WRITE  (9.1011 

(IK,'  b  CHOOSE  K  VECTOR') 

9.1C2I  (NKV(I  I  .1=1  »NP  1) 

3F10.3  I 


N  CHOOSE  K 


•  •  t 


N  +  l 


.MECI  NKV(  NP1MI  )*NKV(  IP1) 


FORMAT 
WRITE  ( 
FORM  AT( 
RETURN 
END 


SUBROUTINE  8If>PMF<P,N»NKV»VECLIM»PQV»PMF,  CUM  PR  ) 

C  THIS  SUBROUTINE  RETURNS  THE  VECTOR  PGV  CCRR  E  SPONCiNG  TO 
C  P**K  TIMES  Q**(N-K)  FOR  K*0 , . .. . VECL IM.  AND  PART  OR  ALL 
C  OF  PMF  FOR  BINOMIAL.  LET  VECLIM  =  N  TC  PRODUCE  ENTIRE  PM F 
C  FOR  LINESEARCH,  IT  WILL  BE  LESS.  VECTOR  NKV  FROM  NCHUZK 
C  CUMPR  IS  CUMULATIVE  PROBABILITY 
INTEGER  I  .VECLIM, N.K  ,UL 

REAL  P,Q,PCV(  21)  ,  PVl  21)  , QV(  2  1 ) ,  PHOLD  ,  QHOLD, NK  V(  21) , 

♦  CUMPR.DI  VER  ,  P  C  .PMF  (2  1 ) 

LUMP  R*0 . C 
C*1.0-P 
LL*V  ECLI M+i 
NP1*N+1 
CIVER*P/C 
*RITE<9» 1C2) 


4C-, 


_ _ _  F.O.PQ, DIVER, VECLIM 

103  FORM AT( 1 X . ' P, C.PQ.DI VER, • ,4F 10.  7 , 
PQ«Q**N 
DO  20  1*1, UL 
PCV{ I )*PQ 
PC*PQ*CIVEF 
P  MF(  I  )*PQV  (I)*NKV( I) 


•VECLIM*  ,15) 


177 


CUMPR  =  CUMPF+PMF(I  J 
20  CONTINUE 

WRIT  E  (9,1011  VECLIM,  P,CUMPR 

101  FORMAT  (  IX, 'VECLIM',  15, 

*  '  P',F8.3,'  CUM  PR0B:',F8.3, 

WRITE  (9,102)  (PQV(I  )  ,1  =1  ,NP  1) 

102  FORM  AT( 8  F10.3  ) 

RETURN 

END 

SUBROUTINE  BINH20  (N,  T  ,  PSTAR ,  HYP , 
C  THIS  FIGURES  THE  ALPHA  HAT  VALUE 

C  VALUE  ANC  THE  N  CFOOSE  K  VECTOR  TO  T 
C  61  NT  ST. 

REAL  PSTAR, ALPhAT, NKV(21)  ,PQV(21 
INTEGER  N,T,HVP,  I  ,TP1,NP1 
CALL  NCHLZKiN  *NKV) 

NP1=  N+l 

CALL  BINFMFI  PSTAR  ,N,  NKV  ,N,PQV, PM 
TP  1=  T+l 
A  L  PH  1  =0 
ALPH  2  =0 

IF  (HYP.  EC. 31  GO  TO  20 
DO  15  I  =TP 1»NP1 

ALPH1=ALPHI+ PMF ( I ) 

15  CONTINUE 

A  LPH=A  LFHl 

20  IF  (HYP.  EC.  2)  GO  TO  30 
DO  25  1  =  1,  TP1 

ALFH2  *ALPH2+  PMF ( I ) 

25  CGNTINLE 

A  LPH=ALPH2 

30  IF  (H  YP.  Nt.l  J  CO  TC  99 

1  F(ALPF.GT.ALPHl)  ALPH=ALPH1 
A  LPH  =  A  L  FH*  2 .0 
99  WRITE(9, lCli  ALPH 

101  FORMATdX.  •  VALUE  OF  ALPHA  HAT: 

ALPH  AT=A  LrH 
RETURN 

END  __  __  __ 

SUBROUTINE  C I  EINO  (N,  T  ,ALPHYp7nk  V 
C  THIS  USES  A  BISECTION  SEARCH  TC 

C  LOWER  LIMITS  (CILL.CIULJ  TO  THE  PROB 
C  GIVEN  THE  NUMBER  CF  SUCCESSES  T  AND 
C  NKV  FROM  NCHLZK. 

C  THIS  USES  SYMMETRY  CF  BINOMIAL,  E. 
C  8  SUCCESSES  OF  10  WILL  HAVE  THE  LIMI 

C  1.0  -  LIMITS  CF  A  OIST  WITH  2  SUCC  0 

INTEGER  N,T,TT,I  ,J,TTM1»LC0UNT 
REAL  ALPhYP,NKV(  21),  CILL,CIUL,ML 
UALPH=ALPhYP*C,5 
LALP  H*1.0-UALFH 
NDIV2=IFIX(FLCAT (N+l )*0.5) 

TT=T 

IF  (T  .GT.NCIV2  )  TT=N-T 
TTM1 =TT-  1 

MLEP  “FLOAT  (TT  J/FLOAT  ( N) 

PLL=0.0 
PU  L=  0.0 

IF  (TTM1.GE.0I  PLL=B  ISEX(N,TTM1, 
IF  (TT.LE.N)  FUL  *  El  S  EX  (  N  ,  TT  ,  NKV  , 
IFIT.LE.NCIV2)  GO  TO  10 
CILL=1.0-PLL 
C IUL=1.C-PIL 
GO  TO  20 
10  CILL=PLL 


PROS:  ' ,  F  8.3 »  '  P  C  VECTOR:') 


ALPHAT, NKV) 

,  ANC  RETURNS  THAT 
HE  MAIN  SUBROUTINE 

)  »PMF  ( 21  )  »CUMPR 


F,CUMPR) 


•  ,F10.3) 


,CILL  ,CI  UL) 

FIND  THE  UPPER  AND 
ABILITY  CF  SUCCESS 
TOTAL  SAMPLE  SIZE  N. 

G.,  A  DIST  WITH 
TS  OF  PROBABI  LIT IES 
F  10  (UP  P  ER<=>LQWZR) 

EP, L  ALPH , UALP  h 


NKV , LALP  H  ) 
UALPH  ) 


178 


LIMIT*  ,F10.3, 


Cl  UL  =  PUL 

IF  (  CILL.LT. 0.0)  CILL=0.0 
IF  <  CIUL.GT.1.0)  CIUL*1.0 
20  WRITE  (9, 10H  Cl L  L ,C IUL 
101  FORMATdX,'  PERCENT  LOWER 

*  •  UPPER  LIMIT, F10. 31 
RETURN 

END 

FUNCTION  eisix(N.T,NKV,  ALPH) 

C  THIS  DOES  A  RAY  SEARCH  TO  FIND  THE  EXACT  P  FCR  WHICH 
C  THERE  IS  A  l.C-ALFhA/2  OR  ALPHA/2  PROB  OF  T  SUCCESSES  IN 
C  N  TRIALS.  NKV  IS  VECTCR  CF  N  CHOOSE  K,K  =  0,...,N 

REAL  NKV  (211  ,  ALPH.LQ  , MED, HI  ,  EPSI  LNyCIMPR  ,PQV(  21)  , 
*GAP,P,EPSM,PHF(  21) 

INTEGER  N  » T »  L COUNT  ,L  L IM 
EPSI LN*0  .C32 
EPSM1*1.  O-EPSILN 
LC  0UNT*0 
LL  IM  =64 

IF(ALPH.LT.O.S)  GC  TC  10 
L  0=0 . 0 

HI=FLCAT(T  J/FLOAT  (N) 

GC  TO  2C 

10  LO=FLOAT (T  J/FLOAT (N) 

FI  =1  .0 

20  PED=0.5*(LC+HI) 

WRIT  E(9 , 2G1 )  LC,  MED,  H  I,  L COUNT 

IF  (  (MED.Lc.EFSI  LNI.  CR.  (MED.  Gc.EPSMl  )  )  GC  TO  95 
201  FORMATdX,*  LC,ME0,HI,  LOOP  COUN  T  * ,  3F  10.  5  ,1  5  ) 
LCCUNT=LCCUNTn 

CALL  BINPMF(MED,N,NKV,T »PQV ,PMF , CUMPR) 

GAP=CUMPR-ALPF 

IF  ( (  ABS  (  GAP )  •  LT.  EPSI  LN)  .CR.  (  LCOUNT.GT .LL  IM)  )  GO  TO  95 
IF(GAP.LT.O.O)  GO  TO  30 
LG-MEC 
GC  TO  2C 
HI =M  EO 
GO  TC  20 

WRITE  (  9 , 1  Cl )  ALPH ,MEO 

FORMATdX,*  ALPHA;  * ,  F10.3,*  Cl  L IMI T :  •  ,F  10.  3 ) 
eiSEX*MEC 
RETURN 
END 

C-/-/ / 
SUBROUTINE  QUANT  L  (NOF.RCCT,  R  P,I  P  ,ALPHAT,  C  ILL,  CUL, 

*  LIST.LPTR.T  1EV,  TYPCEL  ) 

C  THIS  FINDS  THE  PRGBA  EILITY  THAT  A  SPECIFIED  X  ,XSTAR 

C  IS  THE  SPECIFIcO  CUANTILE  POINT  QSTAR  IN  A  DISTRIBUTION 
C  REPRESENTED  eY  THE  VALS  GF  SAMPLE.  THE  CONFIDENCE 
C  INTERVALS  (WHICH  ENCLOSE  POPULATION  QSTAR)  ARE  GIVEN 

INTEGER  NOF.RCCT  (  50)  , IP ( 50) , LPTR ( 17 )  , TIE  V  (2 )  .  T1 , T2 , 

*  T2P2,TlPl,h,NPl,RHAT,SHAT,KEY  (400  >  .TYPCEL  ,  HYP 
REAL  RP  (  50  » A  LPHAT  »CILL»CIUL  ,L1ST(40  0)  ,NKV(  21  ) ,  PSTAR  , 

*  PMF  (21  )  ,ALPHi,ALPH2  ,  ALFMIN,  ZHAT  1  ,ZHAT2,  PGV  l  2 1)  , 

*  CUMPR.ZSTAR.NP.CCNFCC,  LTAIL  .UTAIL,  ZRHAT  ,  ZS HAT , ALPHY P 
WRITE(NOF,523  ) 

523  F0RMAT(1CX,'**QUANTILE  TEST**') 

Tl=RCCTd)-TIEV(  1  ) 

T2=R  CCT (  1 )  +T  I  EV( 2) 

HYP=IP<1  ) 

PSTA  R=RP  (  1 ) 

ALPH YP=  1  .Q-RP  (2) 

HALF  A *0 . 5*ALPFYP 
N=RCCT(  2  1  +  SCCT ( 1 ) 

WRIT  E(NOF  ,334  )  T1.T2 ,N 

334  FORMATdX,'  NUMBER  OF  DATA  ELEMENTS  LESS  THAN  X**,I5/ 


30 

95 

101 


"  V.  - 


179 


-  .N 


102 

103 


10 


15 

104 

30 


108 

40 


174 


101 


95 

107 


*1X,'  NUMBER  OF  ELEMENTS  LESS  THAN  OR  EQUAL 
*I5/1X»  *  TCTAL  NUMBER  OF  ELEMENTS ', I  5  1 
NP1*  N+l 

IF  ( N.GT.2C1  GO  TO  30 


TO  X*  • 


CALL  NCHUZ  K(Ni  NKV  ) 

CALL  BINPMF(P$TAR  ,N, NKV,N ,PQV , PMF , CUMPR) 
CALL  CI20Q  (ALPHYP, N.PMF, RHAT  , SHAT, CONFCOl 
WRITE  (  9 , 1 C2  )  71.  T2.RHAT,  SHAT 


*  PMF  :•  I 


FORM  AT  (IX,  *  T1.T2  ,  RHAT, SHAT*  ,415. 

WRITE  (9 , 103)  (PMF (II  ,I*1,NP1I 
FCRMATUX,  10F6.3I 
T  2P2  =  T2*2 
T 1P1*T 1*1 
ALPHA1*0.0 
ALPHA2*3.0 
00  10  1*1 »  TIP  1 

ALPHA  1*ALPHA 1+PMF (I 1 
CONTINLE 
OC  15  1*T2  F2»  NP1 

ALPHA2*ALFHA2*PMF(I I 
CONTINUE 

WRITE(S,104)  ALPHA1,  ALPHA2 
FORMATdX,*  ALPHA  1 ,  ALPHA2:  •  ,  2F7.  3 1 
GO  TO  40 

STDEV*(  FLOAT (  M*PSTAR*(  1.0-PSTAR  )  1**0. 5 
NP*FLGAT(N  )*PSTAR 
ZHAT1*(T1-NP)/$TDEV 
CALL  NCRMV  (1.  ZHAT  1  ,ALPH1 1 
ZHAT2*(T2-NP)  /STDEV 
CALL  NCRMV  (1,  ZhAT2,ALPH2  I 
ALPH2*1.0-ALPH2 
CALL  NCRMV  (2.Z  STAR, HALFA) 
RHAT*l*(IFIXtNP+ZSTAR*ST  OEV-E  PS  ILN  1 1 
SHAT*l*(IFIX(NP-ZSTAR*STDEV-EPSILNJi 
ZRHAT *(RHAT-NP  l/STOEV 
CALL  NCRMV  <1, ZRHAT.LTAIL I 
Z  SHAT*  (  SHA  T-NP 1  /STuEV 
CALL  NCRMV  <1,  ZSHAT , UTAIL) 
UTAIL*1.0-ITAIL 
CONFCOl.O-LT  AIL- UTAIL 
W  RITE  (9* 10£ )  C CNF CO, LTAIL ,UTA IL 
FORMAT (IX,*  CONFCO, LTAIL ,UTAI  L‘  ,3F7.3) 

ALFM IN*  ALPHA1 

IF  (  ALPHA1.GT.ALPHA2  i  ALFMI  N*AL  P  HA2 
WRITE  (9,17  41  ALPHA1,  ALPHA2  »AL  FMI N 


IF 

IF 

IF 

DO 


■  ALPHA1,  McrnMi, 
AL  PHAT*2. 0*ALF  MIN 
AL  PHAT  *ALPH2 
AL  PHAT  *ALPH1 


ALPHA2,  A LFM IN : * , 2F7 • 3 ) 


I 


FGPMATUX, 

(  HYP. EC. 11 
( HYP. EC. 2) 

(HYP. EC. 3) 

5  1*1, N 
KEY (  II* 

CONTINUE 

IF  ( TYPCEL.EQ.51  GO  TO  95 
CALL  SHSCFT  (LIST  ,KE  Y  ,N  1 
WRITE  (9.1C3J  (L IST(  I), 1*1 
CILL*LIST (RHAT I 
IF (SHAT . GT  .N  1  SHAT=N 
Cl UL *LI S  7 ( SHA  1 ) 

WRITE  (9,1011  ALPHAT  .CILL.CIUL 

WRITE  (NCF  ,10  DAL  PHAT, LI  ST  (RHAT)  , LIST (SHAT  I ,RHAT , SHAT 
. .  HAT, LOWER  AND  JPPER  “  ‘‘ 


.N  * 


PMI  ,  _  _ 

LIMITS  15, S  INOEX  OF 


FORMAT(  IX,  'ALPHA 

*  *  I  NO  E  X  OF  LOWER 
GO  TC  99 
WRITE  (9,1071  ALPHAT  , RHAT, SHAT 
WRITE  ( NCF , 10  7  I  ALPHAT,  RHAT  ,  SHAT 
FORMAT(  IX, 'ALPHA  HAT',F7.3, 

*  '  INCICES  OF  LOWER  AND  UPPER  BOUNDS' 


BOUNDS' 

UPPER 


,3 F9.3/1 
LIMIT',1 


5 1 


tc 


I  5  1 


180 


99 


109 


WRITE  (9.139)  C0NFC0 
WRITE  <NCF,109>  CCNFCC 


C-- 


FORMAT( IX.' INTERVAL  CONFIDENCE  COEFFICIENT: •  ,F10. 3) 

RETURN 

END 


C 

C 

c 

c 

c 

c 


10 


15 


20 


SUBROUTINE  CI20Q  ( AL PHYP.N.PMF , RHAT , SHAT , CONF CO) 

THIS  SUBROUTINE  FINOS  INDICEa  RHAT  ANO  SHAT  THAT  WILL 
INCLUDE  WILL  GIVE  A  CONFIDENCE  COEFFICIENT  OF  AT  LEAST 
1.0-ALPHA,  USED  FCR  QUANTILE  TEST. 

RHAT  IS  INOEX  CF  LOWER  LIMIT,  SHAT  IS  INCEX  OF  UPPER 
BOUNC.  SINCE  INDICES  OF  PMF  ARE  1 
TO  N  +  l ,  S=SHAT,  R *RH AT 

INTEGER  RHAT  ,  SHAT  ,RM AX,  SMIN»N»NP1»I  ,IM1,J,JP1 
REAL  PMF  (21)  ,  CDF  (23)  ,  RCDF  (23  )  ,AL  PHYP  ,  HAL  FA,  BEST,  CRIT  , 
*  CRIT1.CRIT2  ,CQNFCQ 
NP 1=  N+l 

HALFA=0.5*ALPFYP 
1=1 

CCF(  1)  =  PMF  (1) 

CONTINUE 
I  Ml*  I 

IF  (CCF(I)  .GE.HALFA1  GO  TO  15 
I  *1+1 

COF(  I  )  =CDF  ( IM 1 ) +P  MF(  I ) 

WRIT  E(9, 191)  I ,  PM  F  ( I  ) ,  CDF  ( I ) 

FORMAT (IX,  *I,PMF(  IM1 )  ,CDF(I)*  ,I5,2F10.4> 

IF  ( CDF  <  D.L6.HALFAI  GO  TO  10 
RHAT  * IM1 
J=NP 1+1 
RCDF  ( J)*0 
CONTINUE 
JP1*J 
J*J-1 

RCDF  (  J )  *RCCF(  J  FI  I  +PMF(J) 

W  RIT  E  (  S  ,  19  2 )  J,RCDF{  J),PMF(J) 

FORMAT  (IX,  *J,RCDF  (J).PMF(U)'  ,  I5,2F10.4) 

IF  (RCDF(J).LE.HALFA)  GO  TO  20 
SHAT  *J 

CONF  CQ=1 .O-COF(RHAT) -RCDF(SHAT) 

WRITE  (9,101)  CDF(RHAT)  ,RCDF(SHAT)  .CCNFCC, RHAT, SHAT 


191 


192 


101 


Cl 


,/lX 


FORM  AT(1X  ,'LO  ,UP  ,  CON  FCO ,  INDI  CES , 

*  3 FI  0.4 ,215  ) 

RETURN 

END 

c-/-/ / 

SUBROUTINE  COXSTU  ( L  I  ST  ,LPTR  ,IP  ,  RP  ,NCF,  NT  EST  ) 

THIS  SUBROUTINE  DIVIDES  THE  ONE  VARIABLE  IN  TWO,  THEN 
PERFORMS  THE  SIGN  TEST  TC  TEST  THE  T RENO  AS  A  MATTER  OF 
PAIRING  THE  FIRST  HALF  DATA  ELEMENTS  WITH  THE  SECCND  HALF 
THERE  ARE  TWG  OPTIONS:  1-ONE  VARIABLE,  IN  PRESENT  ORDER, 
2=C0RRELATICN,  RANKS  OF  THE  INDEPENDENT  VARIABLE 
ORDERING  THE  CEPENDENT  VARIABLE 

INTEGER  LFTR(  171 , IP <  50)  »NCF ,  TLPT  R  (3  )  ,  NHAL  f,MEC»NHT2» 

*  HYP,  INCEPC  »DEPC  ,KEY  (  200)  ,  NACT  UL 
REAL  LIST  (400  ),RP(50  ) ,  TLI  ST  (  400  )  ,SLIST(4CC)  ,DEP(20Q)  , 

*  INDEP(  200  I 
WRIT  E (NQF ,  523  ) 

WRITE(9, 523) 

FORMATdCX,'* *C0 X  AND  STUART  TEST  FOR  TREND***) 

“  - -  “  "  TO  7 


C 

c 

c 

c 

c 

c 


523 


IF  ( IP( 2  )  • cQ.  1 )  GO 
N*LP TR( 2  )-l 
INOE  PC*  0 
CEPC=0 

IF  ( I  P (3  )  •  LT  •  I  F(4  )  ) 
IF  ( I  P (4 )  •  LT.  I  P(3  )  ) 
CO  5  1*1  ,N 


I NCE  PC=N 
D  EPC  =  N 


*  '  \  \ 


c-/-/ 


KEY(  I  )*I 

I  NOE  P (  1 ) *L 1ST (  I+I NDE  PC  I 
0 E P (  I  )*HST(I  +  OEPC) 

CO  NT  INUE 

CALL  SHSCP7(INDEP,KEY  ,NI 
CO  6  I*  1 » N 

SLIST  ( I)*DEP(KEY(  II) 

WRITE(S,91£)  I  ,KEY(I  )  ,SLIST(I) 

FORM  AT  (IX,  'IN  COX  STU:  I  ,KE  Y,SL  I  ST :  *,215, FI  0.3 1 
CONTINUE 
GO  TO  9 
CO  8  1*1 ,N 

SLISTI  1>*L1ST(  II 
CONTINUE 

NHALF«IFIX(0.S*(LPTR(2l-li) 

NHT2=2*NHALF 
TLPT  R (1 )  *1 
TLPT  R (2  ) *H-NHALF 
TLPTR(3)  *1+NHT2 
NACTUL*LPTfi(2  )-l 
HYP* IP(  1  I 
CO  10  1*1,  NHA  LF 

TLIST(II*SLIST(U 

CONTINUE 

IF  (NACTUl.GT.NHT2)  GO  TO  30 
DO  2  0  I*NHALF  .NACTUL 
TL  1ST  (  I  )*SL  1ST  ( I ) 

CONTINUE 
GO  TC  50 
LL*2 +NHALF 
DO  40  I *LL , NACTUL 
I  M 1*  I  - 1 

TLIST(IM1I*SLIST<  II 
CONTINUE 

IF  (H  YP.  EG.  II  WRITE  (NQF.lllI 

FORM  ATI  1 X  » *  NLLL  HYPOTHESIS:  THERE  IS  NO  TRENC  ') 

IF  (HYP.  EC.  21  WRITE  (NOF, 112) 

FORM  AT(  1 X  ,  'NULL  HYPOTHESIS:  THERE  IS  NO  UPWARD  TREND') 
IF  (HYP. EC. 31  WRITE  (N0F,113I 
FORM  AT( IX, 

*  'NULL  hYPOTHESIS:  THERE  IS  NO  DOWNWARD  TRENC' I 
CALL  SGNT(TLIST,TLPTR,N0F,NHT2,RP.  IP.NTESTl 
RETURN 

END 

- / 
SUBROUTINE  SGNT(  L I  ST  ,LPTR  ,NOF  ,T  OTELM  ,  RP ,  I  P,  NT  EST  ) 

REAL  A(20CI  ,B  (200  ),RP(50),LI  ST(  400)  ,  P  ,EP  S  ILN,  ADI  F  ,DI  F  , 
*NKV(  211  ,  A  L  FHA  T ,  A  L  PHY  P,CIUL,CILL,  ALPHA!, ALPHA2 
INTEGER  LPTR (  17)  ,I,J ,NELMV,I E,K, M.NPLUS, NMINUS,NTOT, 

♦  HYP,  IP(  50 1  ,NT  EST  ,  NOF 

NELMV  IS  THE  ^MBER  CF  ELEMENTS  IN  VECTORS  A  AND  B 

NPLU  S*0 

NMINUS*0 

IF  (NTEST.EQ.5)  WRIT  E  (NOF  ,523) 

FORM  AT ( 1  OX  ,  '♦’♦SIGN  TEST**' ) 

NELM  V*LPTP  (21  -1 
EPSI  LN*  0.  CCO 1 
HY  P*  I P  ( 1  ) 


IF  (HYP. EC. II  WRITE  (NOF, ill) 

IF  (HYP. EC. 1)  WRITE  (9,111) 

FORM  AT( 1 X.  '  HC:  P (♦} *P( - ) '/ IX, '  Hi: 
IF  (HYP. EC. 21  WRITE  (NOF, 112) 

FORM  ATI 1 X » '  HC:  P  (  ♦)  <*P  ( -  )  •  /  IX,  •  HI: 
IF  (HYP. EC. 3)  WRITE  (NOF, 1131 
FORMATdX,'  HC:  P  (♦)  >*P  ( —  I  •  ✓  IX, '  HI 
ALPHYP-1  .C-RP  (1) 

DO  14  1*1, NELMV 


P(  +  IOP(-l'l 
P(+  )>P(-I*  ) 
P  <  +  )  <P  ( -  I  •  > 


AC* 


OIF 


kPTeHiIft 


917 


14 


987 
9  86 


C 

C 

r 

c 

c 

c 


102 


ir  inrr.cwtJi  al rnAi  “dirni 
WRIT  E(N0F  , 102  )  NELMV  .NTOT  ,NPLU: 
FORMAT (IX#*  NUMBER  OF  PAIRS:  *  t. 
•PAIRED  VALUES  NOT  TIED', 15,/* 


104 

105 
99 

c-/-/- 


523 


10 


30 

32 

35 


103 

40 


101 


L  1ST t  I  J-LIST!  J) 

WRITE  <9,91  7)  I,  J.  LIST  (I) .  LIST  (J) ,  0  IF 
FORMAT  (IX  »  'IN  SIGN  :I  ,  J  ,L(  I-J )  »DI  F :  •  ,2  1 5, 3 FiO.3) 
AOIF=AES( 0  IF) 

I F (AD  IF  .LE  .EPS ILN )  GO  TO  14 
IF  (OIF.LT.O)  NPLUS*  NPLUS+1 
IF  (CIF.GT.O)  NMINUS*NMINUS+1 
CONTINUE 

WRIT  E (9  »  $€  7 )  NPLUS, NMINUS 
WRIT  E(NQF  1 986  )  NPLUS.NMINUS 
FORMAT!  1 X  » • SIGN  TEST  *21  51 

FORM  AT!  IX. '  NUMBER  OF  !♦):',  15,'  NUMBER  OF  (-):', I: 
NTOT=NPLlS+NMINUS 

IF  (NT0T.Lc.2CI  CALL  NCHUZK  (  NTOT  ,NKV  ) 

IF  (NT0T.LE.2C)  CALL  BINH20 

*  ( NTCT.NPLUS  ,0.5  » HYP  » ALP  HAT  ,  NKV  I 
IF  (NT0T.GT.2C)  CALL 

♦  El  NORM!  NTCT,N  PLUS  ,0.  5  ,HY P,AL  PHAT  .ALPHYP ,  C ILL  ,  CIUL) 
ALPHA2*1.C-ALFHA1 
ALFM  IN* ALFPA1 

IF  (  ALPHA l.GT  .ALPHA2  )  ALFMI N*ALP HA2 
IF  4  HYP.  EC.  1)  ALPHAT-2.  0*ALFMIN 
IF  ( HYP. EC. 2  I  AL  PHAT *ALPH2 
IF  (HYP.  EC. 3)  AL  PHAT  *ALPH1 

NPLUS, ALPHAT, ALPHYP 
15,'  N  CF  ', 

.  .  NUMBER  OF  PLUSSES',15 

/•  ALPHA  PAT :  ' ,F  10.4/ '  HYPOTHESISED  ALPHA  *,F10.4) 

IF  ( ALPHAT. LT.  ALPHYP)  WRITE!  NCF,  1  04) 

IF ! ALPHAT. GE.  ALPHYP)  WRITE(N0F,1  05) 

FORMAT  ( IX » '  REJECT  THE  NULL  HYPOTHESIS  *) 

FORMAT (IX,*  DC  NOT  REJECT  THE  NULL  HYPOTHESIS') 
RETURN 
END 

SUBROUTINE  MANWHT  (LP  TR,  LI  ST  ,NOF  .  RP ) 

INTEGER  N1.N2.NT  ,NHOLD,  I.J.LPTR!  17),  IER 
REAL  A!  400  .LIST  (400),  RP(1)  ,U,Z , R (400 ) ,ALPH f ALPHAT 
WR IT  E(NQ  F  ,523  ) 

FORMAT!  1  OX, •♦♦MANN  WHITNEY  TEST*^' ) 

N1*LPTR! 2 ) -LP  TR!  1) 

N2*L PTR ( 3  )-LP TR! 2 ) 

NT*N1+N2 

IF(N1.LE.N2)GC  TO  30 
DG  10  1*1,  NT 

J*LPTF(3 I -I 
A  (  I  )*L  1ST!  J ) 

CONTINUE 
NH0L0*N2 
N  2*N  1 
Nl-NHCLC 
GO  TO  25 
CO  32  I *1 »NT 
A  ( I)  *LiST (  I) 

CONTINUE 
CALL  UTEST  ( A  ,R, 

IF(IER.LT.l)  GO 


NI,N  2 ,U , Z , I ER) 
TC  40 


WRITE  (N0F.I03) 

FORMAT  (IX,1  ALL  VALUES  OF  ONE  SUBGROUP  ARE  TIED') 
IF  (N2.LT.20)  GO  TO  95 
CALL  N0RMV(1,Z,  ALPHAT) 

WRITE  (NCF, 101)  U,Z, ALPHAT 

FORMAT!  IX  ,  •  TEST  STATISTIC  U:'»F10.4, 

•STANOAFCIZEO  Z  :*,F10.4/1X, 

•  ALPHA  HAT:*  ,F10.4) 

ALPH*1.0-RP(  1  ) 
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IFULPHAT.LT.  ALPH) 
IFULPHAT.GE.ALPH) 

104  FORMAT ( 1  X  » •  REJECT 

105  FORM  AT (  IX,'  DC  NOT 
GC  TC  99 

95  WRITE(401 ) 

401  FORM  AT(  1  X 


W  RIT E( NOF  *  104 ) 

W  R  ITE  (NOF  *105  ) 

THE  NULL  HYPOTHES IS  • ) 
REJECT  THE  NULL  HYPOTHESIS*) 


»  U.M.N2 

TEST  STATISTIC 


UJ*  ,F  10.3/1X, 
N2=*,  14) 


*  *  WITH  Nl*'  ,14,'  ,  AND 
99  RETURN 
END 
C- 

SUBROUTINE  MCNEMR ( RC  CT,  RP  ,1  P  ,NQF  ) 

C  THIS  SUBROUTINE  FIGURES  THE  VALUES  OF  THE  MCNEMAR  TEST 
C  IF  N<21,  IT  USES  1HE  EXACT  VALUES  OF  THE  EINCNI AL ,  ELSE 
C  USES  THE  LINEAR  INTERPOLATION  OF  CHI-SUUARE  DF*1 
_  (T2.  NKVI21) 


523 


103 


30 


104 


INTEGER  RCCT(4),IP(50),B,C,N,‘ 

REAL  RPI  50  .ALPHAT.ALPHYP.fi 
WRITE  ( NCF ,522 ) 

FORMAT(  1CX  ,  ***MCNEMAR  TEST**  * ) 

E*RC  CT(  2  ) 

C=RC  CT  ( 3  ) 

ALPH  YP*1  •  C-RP  (1) 

T2*0 

N*B+C 

IF  (  N.GT.20)  GO  TC  3C 

CALL  B  INH20IN.B,  0.5,  l.ALPHAT  ,NKV  ) 
ALPHAT*C.  5*ALPHAT 
WRITE  (NOF, 103)  B  .ALPHYP  ,  ALPH  AT 
F  ORMAT  ( IX ,  •  T2  15, 

■  •  HYPOTHESISED  ALPHA, ALPHA  HAT  *,2F7.3) 

GO  TO  40 

Tl*(  FLOAT  ( ( B-C  )**2 ) )  /FLCAT (  B+C) 

CALL  CHICNE  (U,  ALPH  AT) 


WRITE  (NOF, 104)  T 1  .ALPHYP  , ALP hAT 


.  .  Tit  .  _ 

F ORMAT ( IX » '  Tl  ',F9.3. 

•  HYPOTHESISED  ALPHA, ALPHA  HAT 


• , 2F7. 3  ) 


(NOF.lOi) 

HYPOTHESIS' 


) 


102) 

HYPOTHESIS') 


40  IF  ( ALPHAT.LE. ALPHYP )  WRITE 
131  FORMATdX,'  REJECT  THE  NULL 

IF  (  ALPHAT.GT. ALPHYP  )  WRITE  (NOF,. 

102  FORMATdX,*  DC  NOT  REJECT  THE  NULL 
WRITE  (NCF, 111)  (RCCT(I)  ,1  =  1  ,4) 

111  FORMATdX,'  MCNEMAR  T  ABLE :'/ 25X,  'AFTER: '✓  IX.  •  BEF  ORE 

*  12X,  'SUCCESS  FAILURE*  /IX,  •  SUCCESS*  ,3X,2ilO/  IX, 

♦  •FAILURE', 2X,  2110) 

RETURN 

END 

subroutinI  CHI  ON  I  (x.alphatT 

C  THIS  SUBROUTINE  FINDS  AN  APPROXIMATION  FOR 

C  CHI-SQUARE  0F*1  BY  LINEAR  I NTERPOLA  T  ION 
REAL  CHI  (8),ALPHAT,P (8) 

INTEGER  N 
CHI(  1)*0 
CHI( 2  )*1  .223 
CHI(  3)*2. 706 
CHI(  4>*3.£41 
CHI( £)«5.C24 
CHI(  6>*6.  £25 


8lif 


7 )*7 • £  79 
)*1 C.  £3 


:hi(  8 

P(l) *0 
P( 2)  *0.75 
P  ( 3)  *0.9 
F <  4 )  *0.95 
P (  51  *0.  975 
P(6)  *0.  99 
P(7)  *0.995 
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CD  CD 


I 


I 


10 


IF 

IF 


•AN 0 •  (  I.LT.8)) 
20 


GO  TO  10 


20 


95 


C-/W- 


C 

c 

c 

c 

c 

c 


TH 
IP  1 1 
IP ( 2 
FOR 
BLOC 
01 FF 


P (  8 )  *0.999 
CONTINUE 
1*1+1 

(  (X.GT.CHI  (I)  ) 

(  I.LT.8)  GC  TO 
P  HAT  =  C • 999 
GC  TO  95 
IM1*  1-1 

RATI  0*(  X-CHI (  1M1  )  )/(CHI  {  II-CHI!  IM1)  1 
PH  AT  *P(  IM1)+RATIQ*(P  ( I)-P(IMl) ) 

ALPH  AT*1  *C-PH  AT 

RETURN 

ENO 

/ 

SUBROUTINE  FRIEOM  (N  TREAT  ,LPTR,  L  1ST,  IP,  R  P  ,NOF  ,TOTELM  » 
IS  OOES  EITHER  THE  FRIEDMAN  AN 0  DUR8IN  TESTS? 

)  *0,  FRIECMAN,  *1,  CURB  IN;  IF  DURBIN,  R  P  (2 )  =FI  LLER, 
)=K*#  TREATS  PER  eLOCK;  IP(3)=R*#  BLOCKS  PER  TREAT 
BOTH  ,  RP  { 1  )=CCNF  COEF.  RANQUE  RANKS  THE  VALUES 
K  BY  BLOCK;  PARCCM  COMPARES  THE  SUMS  OF  RANKS  OF 
ERENT  TREATMENTS  TO  TEST  FOR  PAIRWISE  DIFFERENCE 
INTEGER  NCF,N1REAT,LPTR(  17)  ,K,B,  I  ,J,  RI,TOTELM  , 
DFIfOFItKEYUCOiiNBLCCKt  INTEST(40Q)  ,T,R,  IP(50  I 

)  ,  B2 » 


A2»R  J 


524 


5 

625 


REAL  LIST(400  > ,T 2 , SV ALS ( 400)  ,RK(4QQJ 
EPSI  LN,  8K25 » BNl *  DENQ M »NUM ,MP 1 *SR  JS, NUMVAR  ,DEN VAR , 
STDE  V,SRJ!17l  iTtRMl,  TERM2,RP<50)  ,Q(17) 

CO  3  1*1  tTOTELM 
INTEST (I  )*1 
CONTINUE 
MPl*-999999.9 
IF  (  IP<  ll.EC.1)  GC  TO  5 
WRITEINQF , 524  I 

FORMAT!  1CX,  '**FRIEDMAN  TEST***) 

GO  TC  8 

WRIT  E(NQF ,625 ) 

FORMAT(10X,***DUR8IN  TEST***) 

DO  6  1*1 ,TCTEIM 

IF  <  L  1ST  ( I  UGT.MP1)  MP1=LISTCI) 

CONTINUE 
MP1*  MP1  + 1«  C 
P*IP  (3) 

*IP  (2) 


7 

919 
16 


DO  7  I*  1 ,  TCTE  LM 

IF  (  LIST!  I  I.NE.RP  (2)»  GO  TO  7 
L  I S  T  4  f  I  =M  P 1 
INVEST  411  *0 


315 


CUNT  INUE 
WRIT  E!9  »  SIS)  PP! 2  I ,M  PI 

FORM  AT!  1 X  * '  INTEST  VALUES,  DURBIN;  RP!  2)  ,MPi  •  ,2F10.3  ) 
WRIT  E!9, 818)  (INTEST  (I)  ,1*1,  TOTE  LM) 

FORMAT!  IX,  121  5) 

EPSILN-O.COl 
NBL0CK*LPTS!2  1-1 
CALL  RANQUE 

'  (T  QTELM , LIST ,SVALS , RK,Q ,KE Y,NTREAT , NBLOCK , 2 ) 

WRIT  E(9 , 3 15 )  !RK( I), 1*1 ,TOTELM) 

FORMAT!  1X,6F1C.3) 

SR JS *0,0 
A2*0. 0 
E*LPTR<  2  )-l 
T*NTREAT 
CO  10  J*  1  ,T 
R  J*0 

DC  15  1*1, E 

RI*I+LPTR(J) -1 
IF(INTEST(RI  I.EQ.O)  GO  TO  15 
RJ*RJ+RK ! RI) 
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A<-. 


312 

15 


10 


316 


112 


50 


99 


A2aA2*(RK (RI  )**2I 
WRITE  (9,312  I  1 ,J,RI  ,RJ, Ac 
FORMAT (IX  , 1  I  J  ,R  I'  *3  15, '  R  J  »A2  '  ,2F  10.3) 
C  ONT  IMS 
S RJ(  J  )*RJ 
SRJS»SPJS+  1RJ*R  J) 

CONT  INUE 

IF  (  IP(1  UcC.l)  GC  TO  50 
K*T 

E2*S RJS/ ( FLOAT (B ) ) 

CENOM»A2-e2 

WRIT  E  (  9 »  2 1  fc  )  A2, 82,0  ENOM 
WRIT  E  (NO  F  ,316  )  A2,B2,DEN0M 
FORM  AT(  IX  , 'A2  ,B2  ,OENOM*  ,3F10.3i 
IF  (  DENOM. IE.  EPS  I  LN)  GO  TO  99 
BM1=  FLOAT  (g-1  I 

8K25*0.2  5*(FLCAT(E*K*((K+l)^*2n  I 
N('M=  BM1*  (  E2-BK25  ) 

T2»NUM/DENCM 

0F1=K-1 

CF2=  (B-l  >  *  CK-  1 1 
WRITE  ( NCF ,112 )  T2  ,0  Fi,  DF2 
WRITE  (9,1121  T2  ,  CF1 iDF  2 


* ,  F10  .3, 1  DF  1 :  * »  I  5,  '  DF2:',I5) 


101 


102 


FORMAT (IX, 'VALUE  OF  f 2s 
N0MVAR»2.C*B*  (Al-Bll 
CENV  AR*  (  K-  i )  *  ( B-  1 ) 

STOE  V*( NUPVAR/DENVAR )**Q.5 
CALL  PARCCM(SPJ,T,STOEV,NQFI 
C-0  TO  99 
CONT  INUE 
DF 1* 12*( 7-1) 

CF2*  (  P*T*  (K-l  )♦(  K  +  l)  ) 

TERM1*SR  JS*FLCAT  (DF1  )  /FLCAT  (  DF2  ) 

OF  1*  T-l 

CF2«3*R*CF1*(K+1  )/(K-l) 

TERM 2*FL CAT  ( DF2) 

T3*T  ERM1-T  ERM2 

WRIT  E(9 , 10 1 )  7ERM1 ,T  ERM2 »SRJS#T 3 
FORMAT!  1 X  ,  'TEFM1-2  »SR  JS  ,  T3'  »  4F1Q.4) 

WRIT  E (NO F ,  102  )  T3,0F1 

-  - STATISTIC  T3*  »F1G.4  ,/iX  , 


FORMAT!  1  X  ,*  TEST  STA 
♦  •  TO  BE  COMPARED  WlTh  CHISQUARED  OISTRIBUTION'/lX, 


(Til 


♦  »  WITH  DEGREES  OF  FREEDOM*' ,  15 ) 

OF  2*  B*K*GF1 

NUMV  AR*FLCAT ( (R*(K+1 l*(K-l)M 
NUMV AR*NUP  VAR*  (F  LCAT  (CF2  )-T3*FL0Al 
CF2*6*0F1*(6*K-T-B+1 ) 

CENV  AR*  F  L  C  A  T ( CF2 J 
STOE  V»(  NLP  VAR  /DEN  VAR  I  **0*5 
CALL  PARCCM(SPJ,T,STDEV,NOF) 

CONT  INUE 
RETURN 
ENO 

C-/-/-/-/- /-/-/-/-/-/-/-/-/-/ /-/-/-/-/-/-/-/- /-/- /- 
SUBROUTINE  QUAOE  (NTR EAT ,LPTR, LI  ST,  NCF,  TCTELM  ) 

THIS  PERFORMS  THE  QUAOE  TEST.  RANQUE  RANKS  THE  DATA 
ELEMENTS  BLOCK  BY  BLOCK  AND  RANKS  THE  RANGES  OF  THE 
BLOCK  ELEMENTS  (INTO  Q).  PARCOM  HAS  PAIRWISE  COMPARISON 
OF  SUMS  OF  TREATMENT  RANK|^  TO  TEST  FOR  EQUAL  EFFECT 


INTEGER  NCF,NTREAT,LPTR(17)  ,  K  ,B  ,  I ,  J  ,PI  ,K,  6,  TCTELM, 

*  0F1,DF2,KEY(400),NBL0CK 

PEAL  LIST  (TOT  ELM)  ,T2  ,SVALS<400)  ,  R  (400  I ,  B2  ,A2  ,RJ  ,NUM, 

*  EPSILN,BK251EM1,Q(200)  ,  A1 ,  B  1  ,S  (  400 )  ,  SI  J  ,  $J2  ,  SJ  ,  MEAN  , 


524 


crdium,oiN«3,cnx,wi£uvji,Ai,oi,di*tu 

*  OENCM,  OENVAR  ,NUMVAR  ,  STDEV,  SRJ(  17) 
WRIT  E(NOF ,524  ) 

FORMAT!  10X,  •♦♦QUAOE 
KaNTPEAT 


TEST**' ) 


186 


101 

15 


10 

95 

102 


£=LPTR(2  1-1 

A1*0 

B1*0 

S  J2=0 

71  *0 

CF 1*  K—  1 

CF2*DF1* ( £-1 ) 

EPSI LN=  0*  CC1 

MEAN  *Q«  5  * ( FLO  AT( (K+l )  )) 

CALL  RANCLE(TCTELM,LIST,SVALS,R,  Q»KEY  »K»  B  ,3 ) 
00  10  J*  1  »K 
SJ*0 

DO  15  1*1,  B 

RI*I  +  LP7R(  J  1-1 
SI J*Q( I  )*(  R(RI  l-MEAN) 

A1«A1+SIJ*SIJ 
SJ*SJ+SIJ 

WRITE  (9, 101 »  I,J,RI,R(RI),Q(1),SIJ,A1 


>J*S, 


FORMAT  ( 1  X  *' 

CONTINUE 
SRJ  (  J)  *S  J 
S  J2*SJ 2  +  5 
CONT  INUE 

81»S  J2/(  FLCAT(B)  ) 

CIF*  Al-Bl 

IF  (OIF.LT.EPSILN)  GO  TO 
Tl«(  FLOAT  <B-1))*B  1/0  IF 
WRITE  (9,102)  A1,B1,  T1,0F1,DF2 
WRITE  (  NCF  ,102)  A1,B1,T1  ,DF1  ,DF2 
FORMATdX.'Al ',F1C.3,  •  B1',F10.3/ 

•  TPCT  ^  T  £  T  T  5TTr.  T1  • 


I,J  ,RI *  ,313  ,*  R ,Q( I ) ,SI J,A1*  ,4F9.3) 


95 


DF  1  *  » I  5 » '  DF2'  ,15) 


C 

C 


10 

20 

101 


iriui,. 

*  IX,*  TEST  STATISTIC  Tl:*,F10.3, 

NUMV  AR* 2 • C* ( A 1-B 1  )*F  LOAT  (b) 

CENV  AR* FLOAT (  CF2  ) 

STDE V*<  NLP VAR/OENVAR )**0.5 
CALL  PARCCM(SFJ,K,STOEV,NOF) 

RETURN 

END 

SUBROUTINE  COCHRN  (NR  CW»  NCOL  ,  RCCT  ,IP,RP,NOF) 

THIS  SL6R0LTI NE  TAKES  ROw AND  COLUMN  COUNTS  RCCT 
FOR  SUCCESSES,  DERIVED  IN  CELLER,  AND  PERFORMS  THE 
COMPUTATION  OF  THE  CHI  SQUARED  STATISTIC 
INTEGER  SCJSQ  .SRISQ,  RCCT(50)  ,NSUCC,NROW,NCOL  ,NUM  , 
*NOF,  IP(  50)  ,QEf>OM,OF 
REAL  CHI  ,FF  (5C  ) 

NSUC  C*0 
SCJSQ*0 
SRISQ-0 

DO  10  J*1,NC0L 
IC*J+NRCW 

SCJSQ*SCJSC+RCCT(  10**2 
CONTINUE 
CO  2  0  1*1  ,NROW 

NSUCC*NSUCC+RCCT( 

SRISQ*SRIS  C+RCCT( 

CONT  INUE 

WRIT  E (NGF  ,101 )  SRISQ  ,  SCJSQ,  • 

FORM  AT(  1  OX  ,  ***COCHAR  AN  TEST*** 

♦  'SUM  OF  TFE  SCUARES:  R(I)'AI5, 


I  ) 

1 1**2 


I 5/1X ,  'FOR', I  5 


102 


lNSUCC  ,NCOL 
,  /  IX , 

ur  ir:  jwmrcj;  ki  u'f  i3,'i  C(J)*( 

*  •  SUCCESSES  IN',  15,  »  TREATMENTS') 

NUM*  (  NCOL*  (NCCL-1  )*SCJSQ  )-(  <  NCOL-1)  *NSUCC*NSUCC) 
DENOM«(NCCL*NSUCC)-SRISQ 

CH I* FLO AT (NUM )/F  LCAT ( OENOM) 

CF-NCOL-1 

WRIT  E(NOF  ,102  )  NUM ,0  ENOM  ,CH I  ,0F 

FORM  AT(  1  X  , '  NLMERATOR.DENOMI  NATOR:  •  ,215/lX,  *  CHI 

♦  •SQUARED  VALUE:'  ,F10  .5/ IX 'DEGREES  OF  FRE  ECOM :  ' ,  1 5  ) 


187 


RETURN 
END 

C- /-/-/-/-/-/- 

SUBROUTINE  RCCNT  (  RCC T  ,NRCW, NCOL , NCF ) 

C  THIS  SUBROUTINE  PERFORMS  THE  CHI-SQUATEC  TEST  ON  CELL 
C  COUNTS  GIVEN  E V  THE  VECTOR  RCCT  (ROW  £  COLUMN  CELL  COUNT  I 
INTEGER  RCCT (501  ,  NROW  ,NCCL,NCF,MAX,  RC  (50  I  ,CC(  50)  ,0F, 
*UL,LL 
PEAL  EIJtCH 
MAX*  NROW 

IF  (NCOL.  GT.NRCW)  MAX*NCOL 
CO  10  1*1,  MAX 
RC  ( I  ) =  C 
C  C ( I )*C 
13  CONTINUE 

WRITE (9 ,102)  NROW  .NCOL 
WR  IT  E  (NO  F ,  102  )  NRCW.NCQL 

102  FORMAT(  IX  ,*  I  N  RCCNT  :  NROW, NCOL:  *  ,2 1  5  , »  RCCT:*) 

CO  15  I  *  l.NRO  W 

LL*1+NCCL*  (1-1  J 
UL*NCCL*I 

WRITE(9,102)  (RCCT(iW,K=LL,UL) 

WRITE  (NCF,  103  )  (RCCT  (K),K*LL,UL) 

103  F  QRMAT  ( IX  ,  161  5 ) 

15  CONTINUE 

N*0 

00  20  1*1 ,NRO V 
00  22  «J*1 ,  NCOL 

K*J>(  I-i I *NC CL 
CC ( J ) 3CC l  J  )*  RCCT ( K) 

RC ( I ) *RC ( 1 )♦ RCCT (K) 

N*N*RCCT(K1 
22  CONTINUE 

20  CONTINUE 
CHI*  0.0 

CO  30  1*1  ,NRGW 
DC  32  «  *1  *  NCOL 

E  I  J*( FLOAT (C  C (J ) *RC ( I)) ) /FLOAT (N) 

K*J  +  (  1-1  MNCQL 

CFI*CFI+(  (FLCAT(RCCT(KI)-EI  J)**2)/EIJ 
W  R IT  E ( 9 . 67  6 )  I  »J»  K ,E I J  .RCCT  (  K  ) 

6  78  FCRMAT  (IX,  ' I ,  J  ,K,  E  IJ  , CIJ  :  *  ,3  I  5, 2F9 .3) 

32  CONTINLE 

30  CONTINUE 

DF*(  NR0W-1)*(NC0L-1I 
WRITE  (9 , 1C  1 )  CHI  ,  DF 
WRIT  E(NQF  ,101  I  CHI.DF 

101  FORM AT(  1  X, *  CH-SCUARED  VALU E*  , F  10.3  , 

*  '  CEGREcS  OF  FREEDOM*, 15) 

RETURN 
ENO 
c- 

SUBROUTINE  GO  FIT < LIS T ,L PTR, I P,RP ,NOF ) 

C  THIS  GROUP  CF  SUBS  FINDS  PARAMETERS  IF  NEC  E  SS  ARY  AND 
C  COMPARES  AN  ACTUAL  DISTRIBUTION  TO  A  THEORETICAL  CIST. 

C  AVAILABLE  THECRETICAL  DISTRIBUTIONS  ARE  NORMAL, UN IFORM, 

C  EXPONENTIAL.  SAMFLE  SIZE  15  OR  MORE  DESIRED. 

C  IP(1)*#  CIST;  IP(  2)*#  ESTIMATED  PARAMS  J  IP  (3)*N  OF  ERLANG 
C  RP(1  )*MU,  A,  LAMBDA;  RP  (  2)  *VAR  ,  B.  A  LPHA  ;  RP(3I  =  CCNF  COE  F 

REAL  RP(  50),  LIST  (400  » ,C  UMP(  8  0 )  ,  P  ARTV  (  80)  ,CHI  ,  MU,  VAR 
INTEGER  LPTR(17)  , N,E  J (80  )  ,ESTPAR  , NO! ST,NC ELL  , NOF , DF, 


*IP(50),RCCT(8C) 

WRIT  E (NOF , 523 ) 

F0RMAT(1X,***  GOO  ONES  S  OF  FIT 
N*LPTR(  2  )-l 
CHI*0 .0 

CALL  CPVGEN(N  ,CUMP,E  J.NCELL) 


TEST  (CHI  SCUARE)***) 


CALL  MUV  AF ( LI ;T»  N  »MU 

RP(4)=MU 

RP  (5  )*VAR 

NO  IS  T=I  P  ( 1 1 

IF  (NOIST.EC.il 

*  CALL  NCFMD  (CUMP»I 
IF  (NCIST.EQ.21 

*  CALL  UMOCCUMP.IP 
IF  (  NOIS7.EQ.2) 

*  CALL  EXPC  (CUMP,  IP,R 
IF  (NDIST.cQ.4) 

*  CALL  ERLu  (  CUMP  , IP 
IF  (NDIST.EQ.5) 

*  CALL  W  E  IC(  CUMP  » IP 
CALL  CELCN7(RCCT,LIS 
WRIT E(NOF 1 121  ) 
FORMAT!  IX,*  CELL  NR, 

*  2  7( *  -  *  11 

CO  25  I *1  ,NCE  LL 

WRITE(NCF,  122)  I, 
FORMAT ( IX , 2 ( 1 5  *  * 

I  F(I.LT.NCELL)  WR 

F  ORMAT (  IX,  2(  * - 

CONTINUE 

WRIT  £ ( NCF • 124  I  N, 
F  ORMAT  (271  )  /9X 

CO  10  I*1,NCELL 

C HI*CHI*FLCAT ( (EJ 
CONT  INUE 

CF*NCELL-IF(2  1-1 
WRIT E(9 ,1011  N.NCELL 
WRIT  E(NOF, 101  1  N.NCE 
FORMAT!  IX,' WITH  SAMP 

*  '  CELLS  * / IX , *  CHI  -SQU 
IF(IP(21  .EC. 01  WRITE 
FORMAT(  1X/1X,  •  SAMPL 
RETURN 


i  VAR  I 


P,RP,PARTV,L IST,N,NCELL,NCF1 

,RP,PARTV,LI ST, N, NCELL.NO F) 

P,PARTV, LIST ,N,NCELL ,NOF) 

»RP .PART  V, LI  ST ,N ,NCELL,NCF) 

, RP, PART  V, LI  ST, N.NCELL, NCF 1 
7,N,NCELL,PARTV) 


E  J(  J1 


0(J»  ,  PARTITION  VALUE 


EJ(  II  ,RCCT ( I  1 

I*  1  1 

ITE  (N0F.123 1  PARTV (  I  1 
- i  *  1,  F10.  31 

N 

*2181 

( Il-RCCT  (II  1  **21/FL0AT(EJ  (II  1 


, CHI  , OF 
LL, Chi* OF 

LE  LENGTH', 14,*  DIVIDED  INTO', 13, 
ARED  VALUE  OF*  ,F  1C.3/1X  ,  •  OF*'  ,15  1 
(NOF, 112  1  RP (4) , RP(5  1 
£  MEAN  AND  VARIANCE:  •  ,2F10.41 


SUBROUTINE  CPVGENIN,  CUMP  ,  EJ  ,  NCEL  L  1 
C  THIS  SUBROUTINE  FINDS  THE  EXPECTED  CELL  CCUNT  VECTOR  EJ 
C  AND  THE  THE  CUMULATIVE  PROBABILITY  VECTOR  CUMP 
REAL  LI  ST  (400  l,CUMP(  80 »  ,PARTV(80  1,ND5,FN 
INTEGER  LPTR  (  171  ,N,EJ(801  ,ES  TPAR  ,NDI  ST.NCELL  ,NOF  ,IND  5  , 
*  CUM(80  1  ,  INC  EL,  HER,  REM 
FN*F  LOAT ( N  1 
IF  (N.GT.S2I  GO  TO  ^  3 
ND5*FN/5  .0 
IN05*IF I  X (N05  1 
REM*N-(  5*  IND5  1 
NCEL  L*I  NC5 

IF  (REM.GT.21  NCELL=IND5«-1 
CO  12  I  3 1 , NCE  LL 
E  J ( I  I *5 
12  CONTINUE 

IF  ( REM. EC.O)  GO  TO  20 
I  F(REF.  LE.  2)  EJ(1  »*b 
IF(REM.GE.21  EJ<1)*4 
I  F (RE F  . EQ •  2  1  EJ(NCELL)*6 
IFIREM.EQ.2)  EJ(NCELL)=4 
20  CONTINUE 
GO  TC  29 
23  NCEL  L*10 

INCEL*IFIX(FN/10.0) 

REM*  N-l  0*  I  NCE  L 
MER*  10-RE* 

CO  24  1*1,10 


524 


I F  (  I  .LE.MER)  EJ< I )= 1NCEL 
IF  (I.GT.MERI  £J(  I  l  =  INCEL+l 
24  CONTINUE 

29  C'JM(  1 1=  E  J  ( I ) 

CUMP  (1)  =  CLM1  )/FN 
CO  30  1 3 2  »NCE LL 

I  Ml3 I - 1 

C  UM(  I  I  3 £J  (  I  l  +  CLM!  I  Ml  i 
CUMP  (  I )3FLCAT ( CUM (I)  >/FN 

30  CONTINUE 

WRITE  (9,1011  N, NCELL 

101  FORM  AT!  IX,  1  CLMP  GENERATOR  FOR  N=',I5, 

*  •  #CEL  LS  *,15,'  CLMP  VECTOR  :  ' ) 

WRIT  E (9 » 102  )  (CUMP(I  ),I31, NCELL) 

102  FORM  ATI  I X  , 8F7  .3) 

RETURN 

ENO_  _  _ 

SUBROUTINE  NO FMoIcUMP  ,1 P ,RP , PART  V , LI  ST ,N , NCELL,NCF) 

C  THIS  SUBROUTINE  FINDS  THE  PARTITIONS  VALUES  FCR  DIVIDING 
C  THE  SAMPLE  INTC  CELLS,  8ASED  ON  THE  CUMULATIVE  PRCB  VECTOR 
C  CUMP  THE  PARTITION  VECTOR  PARTV  IS  NCELL-1  LONG 

REAL  LIST  (400  ),CUMP(  80)  ,  PART  V(  30  )  ,RP  (  50 )  ,  STDE  V,MU,  VA  R 
INTEGER  N, NCELL,  IP(5Q) 

WRIT  E(NOF, 524  J  IP(2) 

FORMATdX, ‘NORMAL  DIST',13,*  PARAMETERS  ESTIMATED') 

IF  (  IP(2).EQ.C)  GO  TC  5 
MU*RP  ( 4  ) 

VAR»RP(5I 
GO  TO  10 
5  NU3RF(1) 

VAR=RP(  2  ) 

10  STDE  V=VAR**Q.  £ 

NCM13NCELL-1 

WRITE(9,111)  NU.VAR, STOEV.NCMl 
111  FORMAT!  IX, 'MU  ,VAR  ,STDEV  ,NCM1  •  ,3F  8.3,15) 

CO  20  I  3 1  ,NCM  1 

CALL  NCRMV<2,Z,CUMP( I) ) 

PARTV!  I  )3S7DEV*Z+MU 
WRITE  (9  ,12  1)  I  ,CUMP(  I  »  ,Z,  PART 
121  FORMAT  (IX,  'I,  CUMP  (II  .2, PARTV  < 

20  CONTINUE 
RETURN 

EN0_  _ 

C  SUBR  OUT  I NE  cl LCN  T  (RC  CT, LIST ,N, NCELL, PART  V  I 

C  THIS  ROUTINE  SORTS  THE  SAMPLE  AND  FIGURES  THE  CELL  COUNT 
C  OF  THE  SAMPLE  COR  FES  PCND I NG  TO  THE  EJ'S  OF  THE  THEORETICAL 
C  DISTRIBUTION 

REAL  LIST(400  ), PARTV (80 ) 

INTEGER  RCCT(EO)  , N ,NCEL L , KEY  (400  ) 

J=1 

CO  5  I«1  ,N 
KEY!  I  1*1 
5  CONTINUE 

CALL  SHSCRT(LIST  ,KEY  ,N) 

CO  6  j  jNCEL  L 


V  (1 4 

I)  •  ,  I5.3F8.3 


.10 


RCCT  (  I  )30 
CONTINUE 
NCM1  =NCE  LL-  i 
CO  20  I  3 1 » N 

IF(LIST(II.LE.PARTV(NCM1)  I  GO  TO  10 
RCCT( NCELL ) 3 RCCT (NCELL )  ♦  1 
GC  TO  20 
CONTI NLc 

IF(LIST(  I).LE.PARTV(J))  GC  TO  15 
J=  w  +  1 
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15 

20 


SUBROUTINE  UN  1 01  CUMP  ,  I P  ,RP,  P  ART  V  ,  LI  ST  ,N,  NCELL  ,NGF 1 
C  THIS  SUBROUTINE  FINDS  THE  PARTITIONS  VALUES  FOR  DIVIDING 
C  THE  SAMPLE  INTO  CELLS,  BASED  CN  THE  CUMULATIVE  PRCB  VECTOR 
C  CUMP  THE  PARTITION  VECTOR  PARTV  IS  NCELL-1  LONG 

REAL  LI S  T ( 400  )  »CUMP(  80  I  ,  P  ART  V(  80  1  ,RP  (  50 1  ,STDEV,A,8,C 
INTEGER  N, NCELL,  IP(5Q) 
l»R  IT  E  (NO  F  ,  524  J  I P  <  2  > 

524  FORM  AT(1X, 'UNIFORM  DIST,«,I3,«  PARAMETERS  ESTIMATED'! 
A=RP  (11 
E=RP (2) 

C  =  B-A 

NCM1 =NC  £  LL  -  1 
CO  20  I  *  1 ,  NCM 1 

P  ART V  (I)=CLMP(I)*C+A 
20  CONTINUE 
RETURN 
END 


SUBROUTINE  EX 
C  THIS  SUBROUTINE 
C  THE  SAMPLE  INTO  C 
C  VECTOR  CUMP  THE  P 
REAL  LI  ST  (400 
INTEGER  N,NCE 
hRITE(9 , 5241 
524  FORMAT!  IX, 'EX 
♦  •  PARAMETERS 
LAMBDA*R  P  ( 1 J 
IF  <  1P12).EQ. 
NCM1  *NCEIL-1 
CO  20  1  =  1, NCM 
PARTV  (I  »  =  ( 
2  0  CONTINUE 
RETURN 
ENO 


FD4 CUMP ,  IP, RP, PARTV , LI  ST ,N, NCE LL  ,NOF ) 
FINDS  THE  PARTITIONS  VALUES  FOR  CIVIDING 
ELLS,  BASED  ON  THE  CUMULATIVE  PRCB 
ARTITION  VECTOR  PARTV  IS  NC ELL- 1  LONG 
J  ,CUMP(  801  ,  PARTV  (80  1  ,RP  (501  ,  LAMBDA 
LL,  I P  (5  0  1 
IP(  2  1 

FGNENTI AL  OISTR IBUT ION , • ,13 , 

ESTIMATED  1 

Cl  LAMBDA*  1*0/ RP(  4  1 


ALOG(  1-CUMP(  I  1  1  1  / ( -  LAMBDA1 


n 

c 

SUBROUTINE  ERLO(CUMP  ,  IP  ,  RP ,  PART  V  ,  LI  ST  ,N,  NCELL  ,NGF  1 
THIS  FINDS  THE  X  TO  CCRRESPQNED  TO  THE  F  CF  CLMP 

S': 

C  BY 

NEWTON-RAPHSGN  SEARCH  IN  ERLNUT ,  TO  YIELD  PARTV 

REAL  RP(  5CI  ,LIST  (  400  1  ,CUMP(  8  0!  ,P  ARTV  (  801  ,CHI  ,DN,F,X 

♦  LAMBDA, FN 

INTEGER  LFTRt  171 , N,E  J (801 »ESTPAR ,NDI ST ,NC ELL , NOF , OF 

♦  IP(501,RCCT(8C1,  FACT  ,NCM1 

NCM1  =NCE  LL-1 

m 

_  . 

FN=FL0AT(IP(3  11 

LAMBDA*RP  ( 1 1 

IF(IP(1 1  .EC.l  1  L  AMBD  A=FN/ RP ( 41 

NN=I P(31 -1 

FACT  =1 

DO  5  1*1, NN 

FACT*FACT*I 

: 

5 

CONT  INUE 

FN*F  LCAT (FACT  1 

CO  10  1=1,  NCM  1 

F  =CUM  P ( 1  1 

CALL  ERLNUT(F  ,X,NN,FN, LAMBDA! 

PARTV  (  I  >  =  X 

V" 

r. 

10 

CONTINUE 

RETURN 

END 

SUBROUTINE  ERLNUT  (RfX.NN.FN, LAMBDA) 

THIS  IS  THE  NEW  TON-RAP  H  SON  TECH  WHICH  TAKE  S  AN  F  AND 
RETURNS  AN  X;  NN*N-1 i  FN*(N-1) 

REAL  F,  X,  FN.LAMBOA, XLG,  XHI,  XMED.NF, DIF, ACIF,  SLOPE, 

♦  ULX.LXN,  FACT 
INTEGER  NN.N 
NF*F  LOAT (NN  I 
EPS!  LN* 0*0001 
i*  LAMBDA 
X*(NF*1.C)/L 
IF  (F  *GT *0*71  X«X*2*F 
IF  ( F  •  LT  •  0  •  3 )  >»X*3*F 
XLO-O 

XHis100* C*X 
.0  CONTINUE 
LX«L*X 
LXN*0 
FACT-1 

00  15  0*1 » NN 

FACT* FAC T*FL CAT ( J) 

LXN*L>N+ { LX**J ) /FACT 
.5  CONTINUE 

F  X*l*  0-(  (  LAN* 1*  J*  EXP  I  -LX I  I 

DIF-F-FX 

ADIF*AES(D  IF) 

SLOPE*  ( L*(  LX**NN)  *EXP(-LX )  )/FN 
IF(DIF.GT.C)  XLO*  X 
IF(OIF.LE.C)  XHI* X 
X-X+DIF/SLCPE 

IF((X*LT*XLC)*GR*  (X.GT.XHD)  X=0.5*(XLC+XHI) 

WRITE (5,101)  F,FX  , OIF, SLOPE, XLO, X, XHI 
.01  FORMAT ( *  F,FX,OIF,SLOPE,XLO,X,XHIi  *,3F11*7/20X,4F11*  7) 
IF(AOIF.GT.EPSILN)  GO  TO  10 
RETURN 
ENO 


SUBROUTINE  WEIO(CUMP,  1P,RP,PARTV  .LIST  ,N,  NCELL  ,NOF ) 
THIS  SUBROUTINE  FINDS  THE  PARTITIONS  VALUES  FOR  CIVIDIN 
THE  SAMPLE  INTO  CELLS,  BASED  ON  THE  CUMULATIVE  PRCB 
VECTOR  CUMP  THE  PARTITION  VECTOR  PAR  TV  IS  NCELL-1  LONG 
REAL  LI  ST(40G )  ,CUMP(  80)  .PART  V(80  )  ,RP  (50)  ♦  LAMBCA, 

*  APOW.ALPF.F,  * 

INTEGER  N, NCELL. IF(50) 

WRITE(9.524)  IP(2) 

»24  FORM  AT(1X,'WE  JBULL  0  ISTR IBUT  ION  ,  ' ,  13  , 

*•  PARAMETERS  ESTIMATED'  ) 

LAM6CA*RP (2) 

AP0W*1*  O/FP  ( 1  ) 

NCM1 -NCELL-1 
CO  20  1*1  ,NCM1 
F-CUMP  ( I  ) 

X*(  (ALOG(l.C-F)  )**APOW)/( -LAMBDA) 

PARTV  (  I  MX 
tO  CONTINUE 
RETURN 


SUBROUT  INE^WI 
THIS  USES  RANQU 
THE  1ST  TREATMENT 
RANKS  OF  THE  PCS 
TO  GET  A  T  VALUE* 
AOOEO  TO  THE  1ST 
PLUS  A  CONSTANT  I 
REAL  LIS  T(  400 
*  SRP OS,  SRNEG, 


LCOXtLI ST.LPTR, IP.RP.NOF) 

E  TO  RANK  THE  ABSOLUTE  VALUES  OF 
MINUS  THE  20  TREATMENT,  THEN  SUMS  THc 
DIFFERENCES  AND  THE  NEG  DIFFERENCES 
THE  VAR  SHIFT  ALLGWS  A  CONSTANT  TO  BE 
TREATMENT,  E.G.  ,  TO  SEE  IF  THE  1ST  TREAT 
S  SIGNIFICANTLY  LARGER  THAN  THE  2D  TREAT 
),RP  (50  )  ,A0IF(200)  ,  SVALS  (40 0  ), RANK ( 400)  , 
SRANK.SRSU  ,P,PR,Z»ALPH1  , ALPH2, ALFM1  N, 


*CIF,  ALPHAT, SHIFT 

INTEGER  LFTR( 17) » KEY (400 )t IP (50 1 ,NQF,G(2GC1, 

*  NNEGtNPCS  f  NOCIF  *  POS  OIF  (200)  t  IR 
SRANK-0.0 
SR  SQ  *0*  3 
EPS!  LN*0 .CCO 1 
IR-0 
NNEG-0 
SRPO  S*0  •  0 
SRNEG-0.0 
NPOS-O 

NVALS»LPTF(3)-1 
NBL0CK»LPTP(2  1-1 
SHIFT»RP(1) 

CO  10  I  ■  1 yNBLCCK 
K-U-NBLCCK 

0IF-L1STU  )+SHIFT-LI  S T C 1 1 
ABDIF«ABS(CIF» 

IFUBOIF.LT. EPSILM  GC  TO  10 
IR-IR-H 

IF(OIF.LE.C)  GC  TC  13 
NPCS*NF0S+1 
PCSCIF (IR)*1 
I F(0I F.GE.C I  GC  TO  15 
NNEG«NNEG+1 
FCSDIF(IR)®-1 
A  OIF  (  IR)«AE$(OIF) 

CONTINUE 

NOOI  F*NBLCCK-NPOS-NNEG 

CALL  RANQUe(  1  F  ,ADIF»  SVALS,RANK,Q  .KEY  ,1  .NBLOCK  ,1 ) 
WRIT  E  (9 . 2 11 )  lAOIFd  ),I»1,IR) 

WRIT  E  (9  •  211 1  (RANMI  )  ,1-1. IR) 

FORM  ATI  IX,  'DIF/RNK:'  ,7F8.2) 

CO  20  1  *  1  .NBLCCK 


20  1  *  1  .NBLCCK 

I  F ft ?GT IS! I ) 1 S kio i*SRPOS* RANK ( I ) 


I F(P • LT.-O .11  .  _ 

PR«P*ftANK(  II 
SRANK«SRANK+PR 
SRSQ-SRSG+FR*PR 
ACIF (  I  )*A0  1F(  I  )*P 
CONTINUE 

WRIT  E(NOF  ,101 )  NBLOCK,  SHIFT,  NPCS  ,NNEG  ,  NOCIF,  SFPOS, 

*  SRNEG 

FORMAT!  10X,'**wILC0XCN  SIGNED  RANK  TEST**«/1X. 

*  'IN*, 1 4,'  PAIRS, '/IX, 'WITH  1ST  VARIABLE  SUBTRACTED' 
*•  FROM  THE  2D  PLUS  A  SHIFT  OF* ,F 9.3/ IX. •  THERE  ARE', 

*  14,'  POSITIVE  DIFFERENCES, ',14.'  NEGATIVES.  AND'/ 

*  5X  ,  14  ,  '  TIE  (S)  •  /IX  , 'SUM  OF  POSITIVE  RANKS  :*  ,F1 0.2/ 

*  IX  ,  'SUM  CF  NEGATIVE  RANKS:  ',  FI  0.2  I 
IF  (NBLOCK.LT. 20)  GO  TO  50 
2*SRANK/TSRSC«*0.5) 


SPNEG*SRNEG+RANK  (I  I 


IF  (NBLOCK.LT .20)  GO 
2«SRANK/7SRSG«*0.5) 
CALL  N0RMV(1,2.ALFH1  t 
ALt-H  2-1.C-ALPF1 
ALFMIN-ALFH1 
IF4ALPH1.GT.AIPH2 )  AL 


IF4ALPH1.GT.ALPH2 )  ALFMIN-ALPH2 
IF  (HYP.  EQ.i  )  ALPHAT*  2.0+ALFMIN 
IF4HYP.EC. 2)  ALPHAT*  ALPHl 
IF4HYP.EC.2)  ALPHAT*  ALPH2 


IFIHYP.EC. 

ALPHYP*l.C 


MH1T' 


WRITE  (NCF.102)  SRANK ,  SRSQ,  Z  ,  ALPHAT  ,  ALP 
FORMAT!  IX, 'SUN  OF  SIGNED  ranks:'  .F10.2/ 
*•  SUM  OF  SQUARED  SIGNED  RANKS:'  .F10.2/' 
*•  ALPHA  HAT:'  ,F1  0.3,  3X, 'HYPOTHESISED  AL 


ALPHYP 


IF  (HYP.  EG. 1  I  WRITE 
IF (H YP. E  G.2 )  WRITE 
IF (H  YP.  EG. 3)  WRITE 


(  N0F.131) 
( NCF.132) 
( NQF ,1331 


0.2/'  T: • , F10. 3/ 
ED  ALPHA: • ,F10.3) 
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FORMAT!  IX,  'HO  :  E 
FORM  AT( 1 X  , ' HQ  :  £ 


E  (  XI*  E£Y  I  .  HI  :  E(XK>E(YI 
£(  X  )>  *e(  Y  I »  HI:  E  (XXE(Y) 


HI:  E(X)>E(Y)*) 
( NQF  , 121 ) 

(NOF, 1221 


133  FORM  AT(1X  » 'HO  :  E(XK*£(Y)»  HI:  E(X)>E(Y)*) 

IF  (A  LPHYP.GT.  ALPHAT)  WRITE  (NOF, 121) 

IF(ALPHYP.LE.  ALPHAT)  WRITE  (NOF.122) 

121  FORMATdX,*  REJECT  THE  NULL  HYPOTHESIS') 

122  FORMATdX,  •  DC  NOT  REJECT  THE  NULL  HYPOTHESIS') 

GO  TC  95 

50  WRITEiNOF  ,103  ) 

CALL  RANQU£(NELOCK,ADIF,  SVALS, RANK, Q, KEY  ,  1,NBL0CK,1) 
WRITEINOF,  104)  (  $  VAL  S  ( I )  ,  1*1  ,N8L  CCK  ) 

103  FORMATdX, 'BECAUSE  OF  SMALL  SAMPLE  SIZE,  TABLES  '/ 

*  IX,'  SHCULD  EE  USED.  THE  DIFFERENCES  ( NCT  THE'/ 

*  IX, • ABSOLUTE  DIFFERENCES)  FOLLOW  IN  RANK  ORDER:') 

104  FORMATdX, 5FIC. 3) 

95  RETURN 

END 

C-/-/ /- 
SUBROUTINE  MECIA  (NT  REAT  ,  LI  S T,LP TR, N IP,  I  P  ,NQF  I 
C  THIS  SUBROUTINE  FINDS  THE  MEDIAN  VALUE  CF  A  NUMBER  OF 
C  VARIABLES/TREATMENTS,  FINDS  CELL  COUNT  OF  *ABGVE/BELGW 
C  6RANC  MEDIAN  IN  EACH  TREAT,  AND  PERFORMS  A  R*C  CONTINGENCY 
C  TEST  ON  THAT  CELL  COUNT  MATRIX,  USING  FORM  P.172  CONNCVER 
C  IN  ORDER  TO  ALLOW  GROUPING  CF  VARIABLES  INTO  AGGREGATE 
C  VARIABLES;  THE  AG  VAR  1  VECTOR  CONTAINS  THE  INDEX  CF  AGVAR 
C  FOR  EACH  VAR  (E.G..VAR  1  £3  IN  TO  AGVAR  1,  VAR  2,4,5  INTO 
C  AGVAR  2S  THE  AGVARI  IS  1,2,1,2,21.  THIS  INFORMATION  IS  IN 
C  CONTAINED  IN  THE  IP  VECTOR.  NAG  VAR  IS  THE  NUMBER  CF 
C  THE  NUMBER  OF  AGGREGATE  VAR  I  ABLES(  E.  G. ,  HERE,  2) 

C  IF  AGGREGATE  VARIABLES  ARE  NOT  USED , MP*NUMBER  CF  INTEGER 
C  VARI  ABLE  5*0  ,  NAGVAR*NTRE  AT*  NUMBER 
C  OF  VARIABLES  IN  OATA  SET;  AGVARI  IS  1, 2, . ..,  NTREAT 
C  NAV(I)  IS  THE  NUMBER  OF  ELEMENTS  IN  AGVAR  (I) 


NA( I )  IS  #  ELMS  ABOVE  ME 01 AN  IN  AGVAR(I); 
Q50  IS  GRAND  MED; 

INTEGER  ME01*  MED2  ,NB  ( 15  I  ,NA(  15)  ,LL,UL 


IS  1,2,  ..., NTREAT 
IN  AGVAR(I) 

AGVAR(I)  ;NB(  I)*#BELOW 


>NAV(  17) 


iniebCN  pcuj.,  rctfi  »rw  i  id  i  ,inai  idi  ,  ll,ul  ,ur  ,ina  vi  it 
*LPTR  (17)  *KEY(  400  I ,  AG  VAR  1(17  )  ,NAGVAR,  IP(17),NIP 
*A,B,  N, NTREAT,  NN,  AN,Afi 
REAL  LI  ST (400  J,V  ,NUM  ,T, CONST  ,SVAL(400)  ,35C 
A*0 
8*0 
T-0 

IF  (N1P.GT.0I  GO  TO  3 
N AGVAR*NTREAT 
DO  2  1*1, NTREAT 
AGVAR  I ( I ) « I 
NAVI  I  1*0 
CONTINLE 
GO  TO  1C 
CONTINUE 
NAGVAR*0 
00  4  IM.NTREAT 

«vG^j;|P<n 


NAVI 

CONTINLE 


(IF( 
(  I  )* 


P  ( I) 

T.N^GVAR)  NAGVAR* IP(1 ) 


DO  15  1*1.  NAGVAR 
NB(I  )*C 
NA(I )  *0 
CONTINUE 

N*LPTR(  (NTREAU1)  )-l 
MEDI-1FI  XI  C.54FL0AT(  N#1 )  ) 
MED2*IFI X( 0. 54FL0ATI  N+2 ) ) 
CO  18  1*1, N 
KEY!  I  1*1 
SVALI  II*LIST(  I  ) 
CONTINUE 


194 


*.'•  .*>■ 

*  .  1  4  ' 

\-  v  *: 


C 

cm 


CALL  SHSCFT (S  VAL  »KEY  »M 

Q5Q*  0,  5* ( S VAL  (MED1)+  S VAL(  MED2)  ) 

MR  IT  E(NOf  *111  )  ME01,Mg02»  SVALfME  Dl)  »  SVAL  <  MED2  )  »Q50 
FORMAT!  1  X»  *MEC1,MED2  *  ,215,'SVAL!  1 ,2 ) ,MEDI AN ' , 2F10.3 I 
00  30  I  *  1  »NTREAT 
LL*LPTB(I) 

UL*LPTR  ( ( I  *1)  )  -1 
K*AGVARI(I  ) 

N  AV!  K  )  *NAV  (K) +UL-LL+1 


00  20 


I 


IF (V, LE, 

20 

CONTINUE 

30 

CONTINUE 

CO  35  1*1,  NAG  VAR 

A  *A+NA  i  1 ) 

b*B+NB(I) 

35 

CONTINUE 

NN*N*N 

50) 


NA(  K)= 
NB(K)‘ 


NA(K I +1 
NB(K)  +1 


* 


A8*A*B 

CONS T*FL GAT (NN  )/ FLOAT! AB  ) 

CO  50  1*1  yNA6 VAR 

IF  !NAV!I),  EQ.O)  GO  TO  50 
AN*A*NAV( I J 

NUM»!  FLCATCINAII)  I J-FLOAT  (AN  I /FLOAT  !N  )  1**2 
T*T«-CONST*NUM/FLOAT(NAV!  I  )) 

CONTINUE 
OF-NAGVAR-1 

WRIT  EjNOFf  101  1  Q50, NAGVAR 

FORMAT!  IX, 'THE  MEDIAN  VALUE  CF  THE  TEST  IS*,F10.3/1X 
'THE  CELL  COUNTS  CF  THE ',13. 

•  TREATMENTS  CR  AGGREGATES  ARE:  •  I 
WRITE  ( NCF 1 102)  ( I  ,1  -1 , NAGVAR) 

FORMAT!  iSX,X614i 

WRITEINO?, 104  1(NA!II  » 1*1 » NAG  VAR *  »A 
FORMAT!  IX,  •  >  MEDIAN  *.1614) 

WRIT  E (NO  F  ,103  HNB1I)  ,1*1, NAGVAR)  ,B 
WRIT E !NOF  ,  105  I  (NAV! 1 1 , 1 *1, NAGVAR 
FORMAT!  IX,*  *<  NEO  IAN'  ,  161  41 
FORMAT  i  IX  ,  *  TCTALS  »,16I4) 

WRITE  !  NCr  ,106 )  T,DF 

FORMAT!  IX/ IX, 'CHi-SQUAREQ  VALUE'  ,F10.3/ 

■IX,'  DEGREES  OF  FREEDOM' ,15) 

RETURN 
ENO 

treat* 

C  THE 

S8WI  _ 

C  ADDITIONAL  NUMBERS  COMPUTED  HAVE  SOME  MEANING 

REAL. TERM?, RfS$RiSSQiT,i 


50 

101 

102 

104 

103 

105 

106 


},N 


SUBROUTINE  KRLWAL! LI  ST, LPTR, IP, RP  ,NOF .NTREAT, 

THIS  SUMS  THE  RANKS  OF  EACH  TREATMENT  AND 
SQUARE  OF  ALL  RANKS  TO  FIND  THE  OVERALL  CFI  SQUARE 
ISTIC  FOR  THE  NULL  HYPOTHESIS  THAT  ALL  TREATMENTS 

In  equal  mean,  if  reject  that  null,  then  the 


IN 


PTR! 17  - 


,LIST(400)  ,RP(50) 

0CJ,FN(17).SVALS  (400  - 

!  17)  •  IP!  50)  ,  NGF  ,  TREA 
(400)  ,0F1,DF2 


gy 

Streat+u  *-l 


•  ispisui, RANK!  400), 

I ,RN I ( 15  ) 

REAT(15) , NT BEAT ,NJ(17) , 


10 


N*LP 
0F1-N-1 
CF2*N-NTREAT 

CALL  RANCUc(N  ,LI  ST, SVALS  .RANK, Q ,  KEY  ,  NTREAT,  0 ,  II 
t-0  10  I*  1, NTReAT 
RNI! 1 1*0,0 
CONTINUE 
SSR*0,0 
SRIDNI-0  ,0 
00  2  0  1*1, NTREAT 
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LL*LPTP(I ) 

UL»LPTR(l  HI)  )-l 
N  J( I ) «L  L+l-LL 
FN(I ) “FLOAT (N J ( I) ) 

00  30  «J*LL  »UL 
R*RANM  J ) 

RM  ( I  )  *R  N I  ( I  l+R 
SSR»SSR*R*R 
30  CONTINLS 

SRIONI*  SR  I  ONI  (RN  I  ( I )  *RN  I  (II  )  /FLOATIN  J (I  it 

20  5I:RmI^LCAT(N*(N  +  1)*  (N+l))/4.0 

SSQ*  ( SSR-T  ERM  2  1/  FLOAT  (N-l ) 

T“(SRIDNI-TERM2)/SSQ 
WRIT e  (NCF,101)  N,NTREAT,  SSR 
101  FORM  AT  ( 1  OX  ,'**KRU$KAL-WALLI  S  TES  T**  *  /  IX.  •  IN  SET  OF* 
*15,*  DATA  ELEMENTS,  OIVICED  INTO*, 15,*  SAMPL£S*/lX, 


%?\ 


♦  •THE  SUM  CF  SCUAREO  RANKS  WASHF10.3, 

♦  »  THE  SUMS  OF  RANKS/ SAMPLE:  •  / 

♦  /IX'  VAR  NR*  »  5X»  '  NR  DATA  ELEMS*  , 5X* SUM  OF  RANKS*  I 
CO  40  I* 1»NTREAt 

WRITE  ( NCr , 10  2 )  TREAT  (I  )  ,NJ  (  I  >,  RNI(  I ) 

FORMAT  (1X«I5,£X,I5,12X,F8.2) 

CONT  INUE 

WRITE(NOF  *103  I  T.CF1.CF2 

FGRMAK1X/1X,  •YIELD  A  CHI  SQUARE  VALUE  OF  :•  ,F  10. 3/iX  , 

♦  •  WI TH*  » 15  *  *  DEGREES  OF  FREEDOM.  IF  ANO  CNLY  IF'/ IX, 

♦  'NULL  HYPOTHESIS  REJECTED#  FOL  NUMBERS  CAN*/1X, 

♦  'TEST  WHETHER  SPECIFIC  PAIRS  OF  VARIABLES  HAVE'/iX, 

♦  'THE  SAME  MEAN.  IF  ABS  DIFFERENCE  IN  MEAN  RANKS*/1X, 

♦  *  iR(I)/N(I)-P(J)/N(  JJi  >  POOLED  STANOARC  DEVIATION*/ 

♦  IX,*  TIMES  THE  T  QUANTILE  l-ALPHA/2  THEN  REJECT  THE*/ 


•NULL  HYPOTHESIS  RE 
'TEST  WHETHER  SPEC! 
•THE  SAME  MEAN.  IF 
*  iR(I)/N(I)-P(J)/N( 


♦  IX,*  TIMES  THE  T  QUANTILE  l-ALPHA/2  THEN  REJECT  THE'/ 

♦  IX,*  HYPOTHESIS  THAT  THE  PAIR  OF  MEANS  ARE  EQUAL*/1X, 


♦  •  (DEGREES  OF  FREEOOM  HERE  :• .  I  4.*  >  • //3X  ,*  I  *, 

♦  •  J  ABS  CIF  MEAN  RANK  POOLED  STANDARD  DEVIATION*  t 
CONST»(SSC*(FLOAT<N)-1.0-T)/FLGAT(DF2))**a.5 
NTM1-NTREAT-1 

CO  60  I«1,NTM1 
IP1-I+1 

DO  70  J*IP1»NTPEAT 

ACMR»A0S(  (RNI(I)/FN(I))-(RNI(J)/FN(J)  )) 
SDPOOL=CONST ♦(  (1.0/FN(I  )  )♦( 1.0/FN(J ) )  » 

WRITE  (NQF, 104)  I ,  J.ADMR  ,  SDPOQL 
104  FORMAT  (2X  ,21  3  ,6X  ,F8.  3  ,14X  ,F10.3) 

70  CONTINUE 

60  CONTINUE 
RETURN 

C— /— / — /— / — / —/—/—/ — / — / — /— / — / — /— / — /— /— /—/—/—/— /— /— / — /— /— /— /■ 
SUBROUTINE  SQLARK  ( L  I  ST  , LPTR ,NTREAT  , TREAT  .1  P,  PP,  NOF) 

C  THIS  TESTS  WHETHER  A  DATA  SET  HAS  EQUAL  VARIANCE  AMONG 
C  ALL  VARIABLES  BY  (1)  FINDING  THE  ABSOLUTE  DEVIATIGNS  OF 
C  EACH  VARIABLES  ELEMENTS,  IN  SUBROUTINE  ADEV,  (2)  RANKING 
C  THOSE  DEVIATIONS  (RANCUE),  ANO  (3)  SUMM ING, SQUARING, ETC.  , 
C  THOSE  RANKS  AS  REQUIRED  TO  FIND  THE  TEST  STATISTIC 

REAL  LI  ST ( 400  ),DEV(400)  ,RP(  50)  ,FN,FM  ,FNT  ,R(  400  , 

♦  SUMR  J(1 7  j  ,FNJ  (17  )  ,R2MU,  RMU,T1,T2  ,  NUM ,  RSQ  ,  SSSR  2J , 

♦  SUMR2J?  17) 

INTEGER  LPTR (17) » NTR EAT,  TREAT!  17  )  ,IP(  SO)  , NOF  ,  KEY (400  ) 
NT*LPTR(  (NTREATh1))-1 
FNT*FLOAT  (NT ) 

CALL  AOE  V  (LI$7»LPTR»  NTREAT,  DEV) 

CO  3  1-1, NT 
K  EY( I  )  *  I 
3  CONTINUE 

CALL  RNQ (DEV, KEY  ,NT, R) 

CO  5  J*l, NTREAT 
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v» 


10 


101 

102 

* 

* 

40 


50 

103 

104 

* 

* 

* 

95 

C - 


SUMR  2  J  (  J)  *C 
SUMRJ  ( J  )*0 
CONTINUE 
SUMR  *0.0 
|UMR2*0.Q 
SUMR4*0.  C 
J*1 

CO  10  1*1. NT 
RSQ*R(  I)*R  ( i I 
SUMR* SUNR+FI I  I 
SLMR2-SLMR2+RSC 
SLMR4»SlMR<+RSC*RSQ 
INCRJ*LFTR  UJ  +  1)) 

I  F(I  •  G  E  •  INCRJ  I  J*J+1 

SUMRJ  (J)  *$UMRJ(  J  J  +R(  I) 

SLMR2 J(J )*SUMR2J(  J) +  RSQ 

CONTINUE 

R2MU*SUMR2/FNT 

RMU*SUMR/FNT 

IF  (  NTREAT. GT.2)  GO  TC  40 
FN*FLCAT7lFTR(2)-1I 
FM*FLCAT(L  FTR(3)-LPTR(2)  ) 

OTERM2«FN*FM/4FNT-l.  Oi 
0TERM1*CTEFM2/FNT 

DEN0M«(CTEFM1^SUMR4-0TERM2*R2MU*R2MU)  **0.5 
N UM-SUMR2J  ( 1J  -  FN*  R2MU 
T 1*NU M/CENCM 

WRITE  (5.101)  0TERMl.0TERM2.su  MR4,R2MU.SUMR2J(1) 
FORMAT (IX.  •DTERM1-2.SUMR4.R2MU.SUMJ2.  1S5FS.4) 
WRITE  (  NCF  »  102  I  T1 

FCRMATdOX  ,'****  SQUARED  RANK  TEST  FOR  •, 

•EQUAL  VARIANCE  ♦♦♦♦•/IX. • TEST  STATISTIC  •, 

•FOR  THIS  TWC  SAMPLE  DATA  SET  IS*,F10.4) 

GO  TO  95 
CONTINUE 

DSQ*<  SLMR4-FNT4R2 MU*R2MU)  /(FNT-1.0) 

SSJMU-C.O 

DC  50  J*1 .NTREAT 

SSSR2 J*SUMR2J( J  )  *SUMR2J  (  J) 

SSJMU*SS  JMU+SSSR2J/  FLOAT  (LPTRHJ  +  l)  )-LPTR(J)  ) 

ffSTi  SJMU- FNT  *R2MU*R2MU) /OSQ 
WRIT  E  (9 . 103 )  SSJMU.R2MU .SUMR4.0S  Q 

FORM  AT  ( IX.  •  s!jUM.R2MUt  SUMR4.0SQ  •  .2F10.4  .F20.4.F10.4  ) 
WRIT  E(NQF.104  )  NTREAT,  T2 
FORM  AT(  1  C>  ♦ 

*♦♦♦*  SQUAREC  RANK  TEST  FOR  EQUAL  VARIANCE  ♦♦♦♦•/ 


RETURN 
END 


IN  A  CATA  SET  OF  *,I3.'  VAR IABLES •/ IX, 
THE  1EST  STATISTIC:*  ,F10.4» 


SUBROUTINE  AO E V ( L  1ST ,LPTR. NTREAT .DEV ) 

REAL  LI  ST  1 400  I  »RP  (50  )  »DEV(400) » V  ARX  »  MU.X  (  200  ) 

“  TEGER  LPTR(  17)  ,K, NTREAT, IP(50)  ,NOF  ,  LL ,  I  PI ,  N I 
10  1*1. NTREAT 
IP1*I*1 
LL*LPTR( I ) 


NI*LPTF(IP1)-LL 
i*l.NI 


12 


30 


'LIST ( (L  L+K-l ) ) 


00  12  K' 

X  (  K  ) ! 

CONTINUE 
CALL  MLtARU.NI.MU.VARX) 

DO  30  Ml.M 
J*LL*I«-1 

0EV(J>*ABS(LISTl  J)-MU) 
CCNTINUE 
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L  I  *L  P  T  R  <  I ) 

WRITE (5.102)  I tLI , LI  ST (LI ),MU,CEV(LI) 

105  FORMAT!  IX,  'VAF#,  LPTR  iLl  ST ,MU»DEV  CF  LPTR  :  •  ,2 1 3  »3F10.  4 ) 
IQ  CONTINUE 
RETURN 
ENO 

C- /-/-/-/-/-// — /-/-/-/- 
SUBROUTINE  HAFTLE (LI  ST, LPTR. NTRE AT .TREAT , IP.RF.NCF) 

C  THIS  SUBROUTINE  FINOS  THE  STATISTIC  FOR  THE  HARTLEY'S 
C  TEST  FOR  EQUAL  VARIANCE,  AN  F  RATIO  BETmEEN  THE  SMALLEST 
C  ANO  URGEST  VARIANCE  IN  A  SET  OF  DATA. 

REAL  LIST (400  ),RP(50 ) ,X ( 200 ) ,VMI N, VMAX.V ARX , FRAT 10, M U 
INTEGER  LFTR( 17) , NTRE AT .TREAT (16 ) ,IP (50) ,NOF, LL.UL, 

*  IP1.NI 

VMIN-IOOCCOO.C 
VMAX*-10.C 
CO  10  I-l.NTREAT 
IP1-I+1 
LL-LPTP(I) 

NI*LPTP(IP1)-LL 
00  12  Ml »  M 

X(K)*LIST((LL+K-1)) 

12  CONTINUE 

CALL  MLvAR(X,NI  ,MU,VARX) 

IF  ( VARX.GE.VMIN)  GO  TO  20 
VMN-VARX 
IVMIN*TREAT( I) 

20  IF(VARX.LE.VMAX)  GO  TO  10 

VP AX*  VARX 
I  VPAX  3  TREAT!  I ) 

10  CONTINUE 

FRATIC*VPAX/VPIN 

WRIT  E(NQF,  101  )  NTREA  T  ,  I VMIN  ,  VMIN  ,IVMAX,VMAX  ,  FRAT  10 
101  FORMAT!  1  OX, 

*  ******  HARTLEY  TEST  FOR  EQUAL  VARIANCE****' /2X,  •  IN'  , 
*•  A  OATA  SET  WHICh  COMPARED  *,I3,'  VARIABLES.  ‘/IX, 

*•  VARIABLE*, 12,'  HAO  THE  SMALLEST  VARIANCE:  ',  F10. 4, 
**,'/lX,'  WHILE  VARIABLE*  ,13,  '  HAO  THE  LARGEST  :•  ,FIO.  4, 
*/lX,  •  WHICH  GIVES  AN  F  RATIO  0F:',F1C.4) 

RETURN 

ENO 

C /-/-/-/-/-/-/-/-/-/-  /- 
SUBROUTINE  CO  BREL  (LI  ST,  LPTR,  NTREAT,  TREAT  ,NOF,  IP  ,RPI 
C  THIS  SUBROUTINE  TAKES  LIST,  COMPOSED  OF  TWO  EQUAL  LENGTH 
C  VARIABLES.  PERFORMS  PEARSON*  S  RHO  ON  THEM,  THEN  RANKS 
C  EACH  VARIABLE,  PERFORMS  PEARSON'S  RHO  ON  EACH  TO  YIELD 
C  SPEARMAN'S  RHC »  ANO  IF  REQUESTED,  FIGURES  KENDALL'S  TAU 
C  IP  ( 1  1  IS  0PTICN:-1*SPEARMAN  CNL  Y,  0*  P  EAR  SONGS  PEARM  AN, 

C  1*PEARS0N. SPEARMAN  ANO  KENDALL 

REAL  LIST!  400  ),RP(50  »,SVALS  (  200  I  .RANK  (400  ),  B1  ,B0  ,TAU  , 

*  TRANK  S(  200  )  ,T  VALS  ( 200  )  .SRANQ  (  200  )  ,TRANC(20Q) 

INTEGER  LPTR!  17)  ,  KEY  (  AOO)  ,N TREAT  .TREATU  7 )  ,NOF, 

*  T  KEY ( 2CC ) , SKE Y (20  0 ) , OPT , I P( 50 ) 

NPAI R*LPTP (2) -1 

IF  I  IP( 1  I.LT.C)  GO  TO  5 
WR IT E (NO F  ,201  ) 

201  FORMAT!  1QX « '**C0RRELAT1  ON**'  ) 

CALL  RHO ER (LI  ST, NPAI R, RHO, Bl, BO, 0) 

WRITEINQF  ,101  I  RHC 

101  FORMAT! 'PEARSON" S  CORRELATION  ON  THE', 

*  '  ACTUAL  VALLES  OF  TWO  VARS :'/ 1  OX, F 10.3 ) 

5  00  10  I-l.NPAIR 

K* I*NP A  IR 
S  KEY (  I)*I 
TKEY (  I )  *1 

1  )*L  1ST  (  I ) 

I )  *L  1ST  (  K ) 


i  r\u  v  l  4 

SVALS  ( 
T  VAL  S  ( 
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CONTINUE 
CALL  RNQ 


(SVALS«SKEY  ,NPAIR,SRANQ ) 

CALL  RNQ  (TVALS,  TKEY  »NPAIR»  TRANQ  ) 

DO  15  I *1  ,NPA 1R 
K*I+NPA  IR 
RANK!  I  )*SRANQ  (  I ) 

RANK(K)«TRANQUI 

CONTINUE 

CALL  RHO£R(RAhK,NPAIR,RHQ,Bl,BO,  0) 

WRITE  (NCF  »102 I  RHC 

FORM  AT!  1  OX  » ***CORR£LATI  ON**'  /IX,  •  PEARSON  MS 
■  1 CORRELAT ION  ON  THE  RANKS  OF  TWO  VARS', 
‘/IX,*,  I.E.,SFEARMAN"S  RHC:', F10. 

IF  (  IPIll.LT.il  GO  TO  95 


NOIS  *0 
NC  ON=Q 


0  1  = 
S*SK 


IMPAIR 
EY  ( 1 1 


IP1*I  +  1 
ISPl-IS+1 
NCONI S*C 
N  CIS  I S«0 

DO  25  L«IP1.NPAIR 
J*SKE  Y( L) 

SC  IF*  SRANC  !J  l-SRANQI  IS) 

I F ( SO  IF.  LT.O  .1)  GO  TO  25 

7DIF*TRANQ! JI-TRANQ 1 1  SI 
1FITCIF  .LT.0.1 )  GO  TO  28 
NC  GN*NCCN+1 
NCONI  S-NCONIS+ 1 
GO  TO  25 

IFITCIF. GT.-0.1I  GO  TO  25 
NO  I S*  ND  IS +  1 
NO  I  SI  3*ND  ISIS  + 1 

CONTINUE 

W  RITE (9 , 14 1 1  ISiNCONISfNOISIS 
FQRMATI1X, 'INDEX  OF  X*',I5, 

•  ^CONCORDANT ,DI SCCNCORDANT* , 215) 


«SiS*  FLOAT  (NCON-NOIS  ) /FLOAT!  NPAIR*  (NPAIR-i  I) 
WRITEINOF,  142  )  NCCN,  NOIS, TAU 
FORM  AT  (  i  0  X  , '  *  ***  KENCALLS  TAU  ****'/lX, 

•  TOTAL  NUMBER  OF  CONCORDANT  PAI RS: *, 15/ IX, 

•  TOTAL  NUPBEF  OF  01  SCONCORDANT  PAIRS: I  5/ IX  , 

•  TAU**  ,  F1C.4  I 
RETURN 

ENO 

/-/-/-/-/-/-/-/-/-/-  /-/-/-/-/-/-/-/-/-/-/-/-/-/-/-/ 
SUBROUTINE  RECRETILI ST, LPTR, IP ,R P ,TR EAT, NCF ) 

<  USES  RHtER  70  FIND  THE  LEAST  SQUARES  FIT  UNLESS 
SPECIFIES  THE  HYPOTHESIZED  LINE  FOR  THE  REGRESSION 
EEN  AN  INCEP  (EXPLANATORY)  VARIABLE  ANO  A  DEPENDENT 
ABLE.  IT  TESTS  FIT  USING  THE  SPEARMANS  RFC  TC  FIND 
CORRELATION  EETWEEN  INDEPENDENT  VARIABLE  AND  THE 
DUALS.  IF  USER  RED' 0  IT,  IT  HILL  FINC  CONFIDENCE 
C$  ON  THE  SLOPE  (Bl)  OF  THE  FITTED  LINE. 

I :  INOEX  Cf  DEPENDENT  VARIABLE  IN  LIST  II  CR  2) 
l:  0*USE  LEAS  1  SQUARES  EST  Bl, 

1*HYP  Bl;  SUPPLIED  IN  RP(1I 
l:  0«NO  EST'S  OF  Y  BY  X,ELSE  *  #  OF  Y'S  EST,RP(3+) 
):  0*NO  CCNFICENCE  INTERVALS,  1*YES 
is  VALUE  CF  W  TO  COMPUTE  THE  SLOPE  INDICES 
INTEGER  LFTRI 171 , IP( 501 ,NOF, NPAI R, OPT  ION ,TREAT( 15), 
NPMl»KcY(2QC)»NR,NS»DIP!4)  ,N£S  TV,  I  X  ,L .  K  ,117  ,147 
REAL  LIST  (400  ),RP(50  )  ,B1 ,  BO  ,RHO,  S(  20000  >  ,  DENOM, ADEM 
X.Y.XSUM.YSUM  ,EY  ,UXl  400  I  ,  QALFD2  ,  SIR 
WRITEINOF  ,201  ) 


;’i 

;  VAR  i 
,  RANK 
;  RE  SI 
BOUN 


IPO 
IP  1 4 
RP  (2 


:NQM«  ADEM 


199 


FORMAT!  1C*  >'**NON-PARAMETRIC  REGRESS  I CN**  •  I 

EPSI  LN*D.CC01 

CPTI ON*  I  F  ( 1 1 

CIP<  1)*-1 

X8AR*0 

Y8AR  *0 

IX-3-CPT  ICN 

I Y*Q  PTI GN 

NP  AI R=LP  TF  t  2) -1 

CALL  RH0ER!LIST,NPAI R,RH0,bl ,80,  GPTICM 
WRITE  <NCf,10H  TREAT  <  I  Y  #  .TREAT  ! IX), 81,80 
FORMAT!  IX, 

•  WITH  DEPENDENT  VARIABLE  AS  VARIABLE  #',I5/1X, 

•  and  Explanatory  /independent  variable  as  *',is/ix, 

•  THE  LEAST  SCUARES  FIT  OF  LINEAR  REGRESSION  IS: '/IX 

•  DEP  VAR  =  •  ,F9.2,'  *  INDEP  VAR  +  ',<=10.21 
B1R*RP!1) 

IF  (  IPI2  l.LT.l)  B1R*81 

IXL*  LPTR !  I  X 1  - 1 

IYL*  LPT  R  { I  Y I  -  1 

WRIT  E  (9 , 67  3  1  IXL,IYL 

FORMAT!  IX,'  X  &  Y  INDICES  :' ,  215  ) 

CO  20  1*1, NPA  IR 
J*NPAIR  +  I 
X*LI  ST  !  I  XL  I ) 

UX(J)*X 

Y*LIST!IYL-*II-B1R*X 
UXII  )*Y 

WRIT  E  !9 , SB7 )  LX!I),UX!J) 

FORMAT!  I  X  , * X ,  RESIO:  •  ,2F10.4) 

CONT  INUE 

WRIT  EINGF  ,102  1  81R 

FORMAT!  IX, 'TO  TcST  CORRELATION  BETWEEN  THE  INCEPEN', 
•DENT  VARIABLE'/IX,*  AND  THE  RESIDUALS  OF  THE  FIT,'/ 
IX,'  WHERE  B1*',F1C.4  1 

CALL  CORRELIUX, LPTR, NTREAT, TREAT, NOF  ,DIP  ,RP) 
NESTY*IP!3  1 

IF (N cST Y • EC. 0  1  GO  TO  24 
CO  21  1*1, NPAIR  , 

XSUM»XSLM  +  LIST!  IXL+I  > 

Y  SUM*  YSLM* LIST ! IY  L*I I 
CONT  INUE 

IFUP12I  .  EQ.l  J  B0*!YSLM-B1*XSUM1  /FLOAT ( NPAIR  I 
WRIT  E(NOF,  121  I  NESTY  ,81 

FORM  AT!  IX. 'ESTIMATE  QF',13.'  DEPENDENT  VARIABLES  BY 
'REGRESS  IlN,*  /IX.'WI  TH  81»*  ,F  10.41 
CO  22  1*1  .NESTY 

EY*Bi*wP! ( 2*1  I l+BO 

WRIT  E  INCr  ,  122  J  RP((2+I»)tEY 

FORMAT  i  IX,  '  INCEP  VAR  VALUE:  •  ,  F10  .4, 

DEP  VAR  ESTIMATE:' ,F10.41 
CONTINUE 

IFCIPI4)  .6C.0  )  GO  TO  95 
I  S*0 

QALFD 2*1.0-!  ( 1 .0— RP!  2  I  i*0.5) 

N*NPAIR 
I  FIN. GT. 601 

W-QALFD2*!  (  FLO AT !N* IN- II*!  2*N+5)  l/lfi.OI  **0.5) 

I  FIN.  LE.60  1  W*HP!  21 
NPM1*NPAIR-1 
DO  30  W.UPMl 
LL*I^1 

CO  35  J*LL, NPAIR 

CENOM-L  1ST ( IXL*  I  l-L  1ST! IXL*J  I 
ADEM*ABS!DENGMJ 
IF!  ADEM.LT.EPSILN1  GO  TO  35 
IS*I S*1 


,F10. 41 
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pi  r  -r.  -r  ■>.  n 


rr.v-wr 


,*v«  ■wl'w  ..wy*.  .•  •-'  -  *  1  ■(.■'•T-.'  -  ■ 


[>! 


i 

3 

i 


-vM 

•.V  - 


35 

30 


876 


103 


)-(  LIST(  IYL+D-L  1ST C  IYL  +  J  H/OcNOM 
IS]  -IS 


IS) 


S(IS 
KEY( 

CONTINUE 
CONTINUE 

CALL  SRSOR  7(S  »KEY 
1 17-17 
I  47-4 7 

wRI  T  E  (  9 »  8  16)  S(1  171  ,S(I47), I  S 
FORMAT (IX t* SLOPES*  ^SLOPES* . 2F9.4,I5 ) 

NR-IF  IX(0.  5*(  IS-WI) 

N  S-I F IX (0. 5*(  IS+W  )  +1 • 96) 

M RITE (NCF,  1031  M , NR  * S ( NR)  *NS . S (NS  ) 

FORMAT  (IX,  *  BASED  QN  A  VALUE  OF  W«*  ,F1G.4/1X, 

♦  •INDICES  CF  CCNFICENCc  INTERVAL*  &  VALUES  OF  SLOPS:'  , 

♦  IX, 'LOWER  BCUNO  INDEX:', 13,*  WITH  VALU E : • , F10.4/1X 


95 


BOUND  INDEX:', 13,'  WITH  VALLE :  •  ,F  10  .4) 


THE 


vt 

15 

>. 

■5 

20 

c*; 

.  * 

101 

S 

.V 


t'ul _ 

UPPER 
RETURN 
END 

C- /-/-/—/-/-/-  /-/-/-/-  /-/-/-/-/-/-  /- 
SUBROUTINE  MONORE (LI  ST , LPTR, IP, RP, TR EAT, NOF ) 

THIS  SUBROUTINE  FcRFCRMS  MCNCTONE  REGRESSION  BASED  ON 
LEAST  SQUARED  FIT  OF  THE  RANKS  OF  2  VARIABLES.  IF  THE 
USER  DESIRES,  ESTIMATES  CF  SPECIFIC  VALUES  OF  DEPENDENT 
VARIABLE  Y  BASED  CN  VALUES  OF  X  CONTAINED  IN  SP  VECTOR. 
ALSO,  IF  THE  USER  DESIRES,  A  TRACE  OF  THE  REGRESSION  *ILL 
BE  LISTEO. 

IP  1 1  I :  #  OF  DEPENDENT  VARIABLE  (  1  OR  2) 

IP ( 2  ):  *  CF  SPECIFIED  X  VALUES  FOR  EXTIMATIQN  OF  Y  VALUES 
IP i  3  12  DO  YOU  *AN  1  A  TRACE?  1-YES  O-NO 

REAL  LI S T (400 ) , X ( 200  ) ,Y(200)  ,RX( 2C0) ,RY( 2 CO) , TERM2, 

♦  B2,  A2, FN  , RP(  50  ,  SXX  ,  SXY 
INTEGER  KfcYX(  200  )  ,KE  YY(  2C0)  ,  NPAI  R  ,IX  ,  IY  ,  I  F{  50  ) , 

♦  LPTR  (17)  ,TPEAT(17  ) 

NPAI R«LPTP(2) -1 
FN-FLOAT(NPAIR) 

IX-(  2-1 P( 1 1  I *NPAIR 
IY*(  IP(  1 )-l ) ♦NPA I R 
WRITE(9,21CI  IX,  IY 

FORM  AT  ( IX,'  INTO  NCNCRE  5  INDEX  OF  FIRST  X  ,  Y:  •  ,  21 5 ) 

CO  10  I-1.NPAIR 
X(  I1*L1ST(  1*1X1 
Y(II-LIST(  1*1  Y  I 
K  EYX ( I  )-I 
K EYY  (  I  )*I 
CONTINUE 

CALL  RNG(X,KEYX,NPAIR,RXI 
CALL  RNQ  (  Y  ,KE  W,  NPAI  R  ,RY  I 
SXY- 0.0 
SXX-0.0 

00  15  I-l.NPAIR 
X ( I) -L  1STI  J+IX) 

Y(IJ«LIST(  I+IYI 
CONTINUE 
CO  20  1*1 ,NPA  IR 

SXY«SXY+RX(II*RY( 

SXX-SXX+RX  ( 1 1  ♦  R X( 

CONTINUE 

TERM2-FN*(  (FN-H.O  )**  21/4.0 
e2»(  SXY-TERM2  )/(  SXX-TERM2) 

A2*(  1.0-02  l*<  FN+1 .0)  / 2.  0 

WRIT  E  (NOF,  101 )  TREAT  (IP(  II)  ,  TREA  T(  (  3-IPl  1)1  )  ,£2,A2 
F0RMAT(1CX, •♦♦MONOTONE  REGRESSION**'  /  IX,  *  USING  ', 

♦  •LEAST  SCLARES  REGRESSION  ON  THE  RANKS  OF  VARIABLES'/ 

♦  IX, 'WITH  VARIABLE  ♦',13,'  AS  DEPENDENT  VARIABLE  Y'/ 
♦IX,'  AND  VARIAELE  #',I3,  •  AS  INDEPENDENT  VARIABLE  X'/ 

♦  IX,'  R(Y)  *  '  ,F9.  3 , '  *  R  ( X)  ♦  *,F10.3) 

NEST -IP  (2  ) 


201 


210 


10 


i! 


.  V.  ■  \  - -V\ 


.V  t'.w.  , 


c— 


N*NP AIR 

IF  (NEST.GT.O) 

*  CALL  ESTY(A2»82»X,Y,K£YX»KEYY»RX,RY,N»NEST»RP  ,NGr  ) 
IF  (  IP(3I.EC.l) 

*  CALL  TFACE(A2,E2,X,Y,RX,RY,KEYX,KEYY,N,NGF) 

RETURN 

END 


C  THI 
C  FOR 


5 

211 

101 

15 

102 


SUBROUTINE  ES7Y 

(  A2  ,82  ,X,Y,KEYX,KEYY,RX,RY,  N,NEST,RP,NOF  ) 

S  FINOS  TFE  ESTIMATES  OF  THE  DEPENDENT  VAR  Y 
VALUES  OF  THE  INDEPENDENT  VAR  X 
REAL  A2 ,82,X(  200)  ,Y(  200)  ,RX(200)  ,RY(  200  )  ,RP  (  50 
INTEGER  KEYX(200)  »KE  YY( 200) » KEY (  200)  »NES  '  N ,X  START » 

Y  ST  ART 

IF  (  NEST. EC. 0)  GO  TO  95 
XSTART*0 
CO  5  1*1 # NEST 
KEY!  I  1*1 
CONT INUE 

CALL  SHSCRKRf, KEY, NEST) 

WRITE  (9.211)  RP  ( 1)  t  KEY  (  1)  »NEST 

FORMATdX,' INTO  ESTY:  RP  ( 1)  ,  KEY  {  1 )  .NEST:  ..10.3,215) 
DO  1C  1*1, NEST 
V  ALX*R  P (  I ) 

CALL  V  TCP  (X  «RX,KEYX»VALX,  RNKX.  N.XSTART  ) 

IF  ( RNKX.GE.  1)  GC  TC  15 

WRITEiNOF  ,101  )  I  ♦  VALX f X (  KEY X  ( 1 1  I  ,X(KEYX(N)) 


FORMATdX, *EST  #*»I3,*  NCT  POSSIBLE;  X**,F10.4/ 
9X»  *  FALLS  OUTSIDE  RANGE  OF*,F10.4,»  TO*  ,F10.4) 
GC  TO  10 
RNKY»A2+82*RNKX 

ystart*o 

CALL  PTCV  (Y»RY,KEYY  ,  VALY  , RNKY.N,  Y START) 

W  RIT  E (NOF , 102 )  I, VALX, VALY 
FORMATdX,  'EST  #•  ,13 


C 


CONTINUE 

RETURN 

END 


•:  X*'  ,  F10 .4,  *  *>  E  (Y I  X )  0F«,F1C.4) 


95 

101 

97 


SUBROUTINE  VTCR! Z ,RZ , KEYZ ,VAL2, R NKZ , N ,Z START ) 
HAS  A  VALUE,  FINOS  ITS  RANK 
REAL  Z(20C),RZ(20C),RNKZ,VALZ 
INTEGER  KEYZ!  200  »  ,N,  ZSTART 
RNKZ—1.  C 

IF  ( (  VALZ.LT.ZIKEYZd  )  ) )  .OR.  (  VALZ  .GT.  Z  ( KE  YZ(  N  )  ))) 
GO  TC  95 

CALL  INTEFFU  ,RZ ♦ KEY Z ,VALZ, RNKZ, N, ZSTART) 

GO  TC  97 

WRITE (9, 1C1J  VALZ  ,Z(  KEYZ  d)  )  ,Z(  KEYZ(  N  ) ) 

FORMAT!  IX,  ’CUT  OF  RANGE  :VAL  ,  LL,  UL:  ' ,  2FI0. 4) 
RETURN 
ENO 


SUBROUTINE  RTCV<  Z,RZ  , KEYZ, VALZ,  RNKZ,  N, ZSTART  ) 

C  HAS  A  RANK,  FINDS  ITS  VALUE 

PEAL  Z!2CC),R2!200),RNKZ,VALZ 
INTEGER  KEYZ!  200  )  ,N,  ZSTART 

IfcTRNKZ.GT.RKKEYZC  1))  ).AND.(RNKZ.LT.RZ(KEYZ!N)  ))) 


50 

95 


GO  TC  50 

IF!RNKZ.LE.RZ!KEYZ(1)  ))  VALZ*  Z  (KEY  Zd  )  ) 
!  FtRNKZ  .GE  *RZ  (  KEY  Z(N  )  )  )  VALZ  =  Z(KEYZ(N  )  ) 
GC  TO  95 

CALL  INTERP (R2,Z, KEYZ, RNKZ, V ALZ, N, ZSTART) 

RETURN 

END 
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SUBRCUT  INE  IN7ERP  ( A,  B  .KEY  ,AV  AL ,  B  V  Al , N  ♦  I  ST  ART  1 
THIS  SUBROUTINE  COES  LINEAR  INTERPOLATION:  KEY  1 5  T  nc 
ORDER  CF  VALS  A  AND  B.  AVAL  IS  KNOwN,  FIND  BVAL.  ISTART 
IS  THE  INDEX  CF  KEY  AT  WHICH  TO  START  LOCKING:  mI  TH  PRIOR 
KNOWLEGED,  SUCH  AS  IF  AVAL'S  WERE  SORTED,  ISTART  HAY  o£ 
GREATER  THAN  C 
REAL  AI20CI 


KCML.  MiiuiifB  (200  1,AVAL,BVAL»PROP,BPRCP 
INTEGER  I,  ISTART  ,N,KL, KU,  KEY  (200  1 
I*ISTART 
10  CONTINUE 
I  =1+1 
K  U*KE  Y ( I ) 

IF((  AVAL.  GT.A(KUJ).  AND.  (  I.LT.N1)  GC  TC  10 
KL*K  EY( ( I-ll  ) 

PROP =<AVAL-A( KL)  i / ( A ( KU  )-A( KL )) 

BPR0P*PRCF*<8  (KU  )-B(  KU  / 

BVAL*B(  KU+BPROP 
ISTART*  I-l 
RETURN 
END 

SLBROUTInI  TRACE (A2» B2*X,Y,RX»RY ,K6YX,KEYY,N,N0FJ 
C  THIS  SUBROUTINE  FINOS  THE  TRACE  OF  THE  REGRESICN 
C  IN  GENERAL  BY  TAKING  THE  ACTUAL  RANK  OF  Y,  FINDING 
C  THE  EXPECTED  RANK  OF  X  ANC  THE  CORRESPONDING  VALuE  OF  X, 
C  AND  THEN  PLOTTING  THAT  X  WITH  THE  ACTUAL  VALUE  OF  Y. 

REAL  EX  (  2CG1  ,  EY(  200)  ,X(200),  YI200 ),  RX  ( 200  ), R  Y  (200) , 
♦RY8, A2,B2,RY$ 

INTEGER  KEYX(  200  1  ,KE  YY(  200)  , N.NQ F 
K*Q 

WRIT  E(NOF ,101) 

101  FORM  AT  ( 1  OX ,  •  TRACE  OF  ThE  MONOTONE  REGRESSION  '/IX, 

*  •  X  *  »6> ,'  Y  »1 
RYS*  A2+  B 2* (RX  (KE YX(  1  )  ) } 

IF(RYX.LT.RY(KEYY(im  GO  TO  5 
K*K+1 

EX (K I *X (KE YX( 1  I  1 
YSTART»0 

CALL  RTC  V  (Y,RY,KEYY,YY,RYS,N,YSTART) 

EY(K)*YY 

WRITE(NCF,102I  EX(K),EY(K) 

5  CO  10  1*1, N 
J*KEYY (I) 

ERX*(RY(JI-A2J  /B2 
X  STAR  T*0 

CALL  PTC  V  (X,RX,KEYX,XX,ERX,N,XSTART) 

K*K+1 
EX(K)«XX 
E  Y(K I *Y ( Jl 

WRITE (NCF,  102)  EX(K),EY(K) 

10  CONTINUE 

RY8*A2+B2*(RX(KEYX(NJ  )) 

IF  <RYB.GT.RY(  KEYY(N)  ))  GO  TO  20 
K  *K+ 1 

EX(K»*X(KEYX(NII 
Y  START*0 

CALL  RTQV  (Y, RY ,K E YY , YY,R Y8, N , YSTART 1 
E  Y(K  I  *  YY 

WRITE(NCF,  102  )  EX(K),EY(KI 


*& 


n  M  I  cll'lri  1  I  CM 

FORMATi  IX  ,2F1  C*4  1 
CALL  PLOTT ( EX  ,EY  »  N , 1 1 


RETURN 

END 

SUBROUTINE  KCIMQG  (L  I  ST  ,LPTR  ,TR  E  AT,  I  P  ,RP  ,  NOF  1 
C  THIS  SUBRCUTI Nc  FINDS  THE  TEST  STATISTIC  FOR  THE 


203 


IFU01ST.Ew.2J 
IF  ( I  0 1ST  •  EC  .3  1 
1FUCIST.  EC.4  ) 
IF  ( I  CIST. EC. 5  1 


CALL 

CALL 

CALL 

CALL 


C  K0LMCG0RCV  STATISTIC  FOR  SAMPLES  OF  UNDER  31  ELEMENTS. 

C  DEPENDING  ON  NULL  HYP,  THE  TEST  STATISTIC  IS  THE  LARocST  , 
C  SMALLEST  OR  GREATEST  ABSOLUTE  DIFFERENCE  F(X1-S(X), 

C  WHERE  F  IS  THE  HYPOTHESISED  THEORETICAL  CIST  AND 
C  S  IS  THE  EMPIRICAL  CDF.  IP ( 1 1  *  CHOICE  OF  D1STR; 

C  IP  <  2 »  IS  THE  CHOICE  OF  HYPOTHESIS 

C  SEE  THE  DISTRIBUTION  SUBROUTINES  FOR  MEANINGS  CF  THE 
C  OTHER  PARAMETERS  RP  AND  IP. 

REAL  LI  ST (400  )  ,S(400  l,F(400l  ,RP(501 , FNtTSTAT 
INTEGER  LPTR (  171 , IP( 501 ,NGF , IDI ST ,HYP ,KE Y (400 l 
WRIT  E (NOF  ,101  ) 

101  FORMAT! 1  C>, *****  KOLMOGOROV  TEST  ****•) 

HYP*  I P ( 2  1 
N*LP  TR{  2  1-1 
FN*FLCAT(M 
IDIST*IP  (1  1 
DO  10  I  =1  ,N 

S  ( 1 1  *  FLCAT ( 1 1 /FN 
KEY!  I  1*1 
10  CONTINUE 

CALL  SHSORT(L  1ST .KEY ,N) 

IF  ( I  0 1ST  •  ED.  1 1  CALL  NCRMF  (L I  ST,  N  ,  RP  ,  F  ,N0  F  ) 

EXPF(LIST,N, RP,F ,NOF 1 
UNIF(LIST,N»RP»F,NGF 1 
ERLF(LIST,N,RP,IP,F, NCFl 

_  WEIBFILI  ST,N,RP,F  ,NOF) 

CALL  FINCT(S,F»N»TSUP,TMIN»T  MAX ,  NOF  ) 

IfThyp.ec.u  CALL  THYPA(TSUP, NOF  1 
IF  (HYP.  EG  .2  1  CALL  THYPB(TMAX, NOF  1 
IF  (HYP.  EC. 3)  CALL  THY  PC  (TMIN,NGF  I 
RETURN 

ENO _  _ _ 

"subroutine:  erlf7lTst ,n,rp,ip ,f,nofT 

C  THIS  FINCS  THE  THEORETICAL  ERALNG  DISTRIBUTION  F,  WHERE 
C  NN*IP(  11  =SHAPE  PARAMETER,  AND  L=RP(  1  1  *SCALE  PARAMETER 
REAL  LX,LXN,L,X,LIST(4001,RP(501  ,  F(  400 1  ,  DAl 
INTEGER  IP(50 1,1 ,N, FACT, NNM1 
NN*I P(ll 
NNM1 =NN- 1 
DO  30  1*1, N 
FACT*  1 
X*  LIST (1 1 
LX  =  L*  X 
LXN-0 

DO  23  J*l» NFM1 
FACT  =  FACT*J 
LXN*LXN-»(LX«*J  1/FACT 
23  CONTINUE 

IF  (X.LE.O)  F (  I  1  =0 

IF  (X.GT.01  F(  I  1*1  .0-(  1.0+LXN  1  *EXP(-LX) 

30  CONTINUE 

WRIT  E(NOF, 101 1  NN,L 

101  F0RMAT(1X,*WITH  HYPOTHESISED  DIS  TRI BUTUI  ON  ERLANG'/l  X 

♦  •WITH  PARAMETERS  N*»  ,15,*  AND  LAMBDA*  • ,  F 10.  4 1 
RETURN 
END_ 

"subroutine  wIIbfTlist.n, rpTf.nof  1 

C  THIS  FINDS  THE  THEORETICAL  WEI  BULL  DISTIBUTION  F,  W 
C  A*RP  <  1)  *SHAPE  PARAMETER,  AND  L*RP(  2  1  =SC  ALE  PARAMETER 
C  LAMBDA 

REAL  A,  L  ,X,LI$T(400) 

INTEGER  I 
A*RP (11 
L*RP  (21 
CO  10  I  *  1  »N 


WHERE 


>  RP  (501  ,  F(400  1 


204 


F(  I  )*0 

F4  I  >*1  .0-EXP(-((L*X»**AI  » 


X*LI ST ! I ) 

IF(X.LE.O) 

I F(X.GT.O) 

10  CONTINUE 

WRIT  E4NQF  » 101  )  A  » L 
101  FORMAT!  1X»'  WITH  HYPOTHESISED 
♦  •WITH  PARAMETERS  ALPHA 
RETURN 
ENO 


DISTRIBUTION  wEIEULL'/iX, 
•,F10.4,  •  ANO  LAMBQA* • ,F10.4) 


THIS 


SUBROUTINE  UN  IF! LIST ,N, RP.F, NOF I 
SUBROUTINE  FINOS  THE  THEORETICAL  UNIFORM 


LOWER  BOUND*  B*RP(  2  )*UPPER 
LIST (40  0  I » RP (50  I  »F(  400  ) 


DISTRIBUTI  ON 


BCUND 


F  (  I  )a 
F  <  1 1 3 


0 

1 


10 

101 


c 

c 

c 

c 

c 

c 


WHERE  A*RP  41 )! 

REAL  C,A,e,X, 

INTEGER  I 
A-RP  (11 
e*RP (2) 

C*B-  A 

CO  10  I  *  1 » N 
X*LIST!  II 
IF  (X.LE.AI 
IF  (X.GE.B)  .  __ 

IFKX.GT.A  ).  ANC.  !  X.  LT.  B)  I  F  ( I  )*(X-A)/C 
CONT  INUE 

WRIT E (NOF  ,1011  A,e 

FORM  AT!  IX, 'WITH  HYPOTHESISED  DI  STRIBUTUl  ON  UNIFORM*  » 

♦  /IX, 'WITH  PARAMETERS  LOWER  L  IMIT*‘ ,  F10.4, 

♦  •  ANO  UPPER  LIMIT**  ,F10.4I 
RETURN 

ENC 

C- /-/--/-/ /-/-/-/-/-/-/-/-/  — 
SUBRGUT I NE  LI  LIEF  (L 1ST , LPTR .TREAT , 1 P.RP , NOF ) 

THIS  SUBROUTINE  FINDS  THE  TEST  STATISTIC  FOR  THE 
LILLIEFORS  TEST  FCR  SAMPLES  OF  UNDER  31  ELEMENTS. 

DEPENDING  ON  HYPOTHtS  IS,  TEST  STATISTIC  IS  THE  LARGEST 
SMALLEST  OR  GREATEST  ABSOLUTE  DIFFERENCE  F!X)-S(X>, 

WHERE  F  IS  THE  HYPOTHESISED  THEORETICAL  DISTRIBUTION  ANO 
S  IS  THE  EMPIRICAL  CDF.  IP41)  IS  THE  CHGICE  CF 
DISTRIBUTIONS;  IP!2I  IS  CHOICE  OF  HYPOTHESIS.  PARAMETERS 
ARE  ESTIMATED:  RP(1I*SAMPLE  MEAN  »RP  !  2)*VARIANCE 
SEE  THE  DISTRIBUTION  SUBROUTINES  FOR  THE  MEANINGS  OF  THE 
OTHER  PARAMETERS  FP  AND  IP. 

REAL  LIST ( 400  ),S  4  400  ) ,F ( 400 ) ,RP 4  50 1 , FN, TSTAT , MU, 

♦  TMIN  ,TMAX  ,  VAR  ,TSUP 

INTEGER  LPTR! 17) , IP! 501 ,NOF, IDIST,HYP ,KE Y 1400 ) 

HYP*IP!2  ) 

N*LP  TP! 2 ) -1 
FN*F  LCAT (N ) 

CALL  MUVAFUI  ST,N,MU  ,VARI 
WRIT  E!NQ  F  ,  101  )  MU, VAR 

FORMAT!  1  OX,  *♦♦♦♦  LIL  LIEFCRS  TEST  ♦♦♦♦•/IX, 

♦  •WITH  CCMFLTEC  SAMPLE  MEAN*'  ,F10 .4, 

♦  • •  AND  VAR**  , FI 0 .4) 

PP  (1  )  *MU 
RP (2 ) *VAR 
IDIST*IP  11 ) 

IF1ID1ST.EC.2  )  RP!1)  *1.0  / MU 
CO  10  1*1,  N 

S  4 1 )  *FLCAT  (II/FN 
KEY!  I  )*I 
CONT  INUE 

CALL  SHSCRT1LIST  ,KEY  ,N| 

IF4IDIST.EQ.il  CALL  NCRMF  (L I  ST,  N  ,RP ,  F  ,NOF  ) 

IF  (I  C  1ST  .EG.2  I  CALL  EXP  F  4  LI S T,N  ,  RP ,  F  ,NOF  ) 

'ALL  FINC7(S»F,N»TSUP»TMIN,T  MAX  ,  NOF ) 

F  (HYP.  EC.  II  CALL  TH  YPA  (  TSUP  ,NOF  ) 

IF4HYP.EC.2)  CALL  THYPBtTMAX, NOF  ) 


101 


10 


•  • 

51 
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IF(HYP.  cC.31  CALL  THYPCITMIN, NOF  ) 
RETURN 


SUBROUTINE  WllKSH  (L  I  ST » LPTR  ,TRE  AT,i  r  *nr  , rsuri 
THIS  SUBROUTINE  FINOS  THE  TEST  STATISTIC  FlR  THE 
SHAP IRQ—  WILK  TEST  FOR  NORMALITY,  N  <  31  ELEMENTS. 

IP(1I>  NUMBER  CF  COMPARISONS*  TRUNCU  DATA  E  LEMS  /  2) 

RP  (1  ) ,..  •  »RP  (K  )*  HEIGHTS  TO  SCALE  COMPARISONS 

THE  DENOMINATOR  IS  THE  SUM  OF  SQUARED  OEVATICNS  CF  XiAND 

HERE  IT  IS  FOLNO  EY  SXX*  ( N-l )  *VAR(X1 

THE  NUMERATOR  IS  SQAURE  CF  SUMS  CF  COMPARISONS  OF  VALUES 
OF  EQUAL  DEPTH' FROM  END  POINTS,  E.G.  ,  WITH  N  CATA  ELMS, 
X(N)-X<1  ),X(N-1J-*(2I|  AID  IS  MULTIPLIED  TIMES  THE 
DIFFERENCE 

REAL  LIST (400  )  ,S (400 ) ,F(400)  ,RP( SO) , FN, T STAT , MU ,VAR, 

♦  NUM, DENOM,MU,VAR, TSUP.SUMWT 

INTEGER  LFTRU7I  ,IP(  50)  ,NOF,  ID  1ST  ,HYP,K,  KEY (4 CO) 

N*LP  TR(  2 )- 1 

CALL  MUVAMLIST.N.MU.VAR) 

denom«flcatTn-d*var 
CO  5  1*1, N 
K  EY!  I  I » I 
CONTINUE 

CALL  SHSGRT!L  1ST ,KEY , N) 

K*IP<1) 

SUMWT-0.0 
CO  10  1*1, K 

TEMP*RF(ll*(LIST(  N-I+1)-LIST(  I)) 

WRITE!  5*210  RP(1  I  ,LI  ST(N-I*-1  )  ,LIST(I  )  ,TEMf 
10  FCRMATUX,  •RP.LISTHI  .LISTLO.TEMP  *,4F10.4) 

SUMWT-SLMWThTEMP 
0  CONTINUE 

NUM*  SUM  WT*  SUM  NT 
T3-NUM/DENCM 

WRIT  E(NOF,  101  I  N  ,MU,  VAR,T3,NUM,  DENOM 
01  FORMAT!  1 C>  »•**♦♦  SHAPIRC-WlLK  TEST  FOR  NORMALITY  ♦♦♦♦ 

♦  /IX,  ‘WITH  A  SAMPLE  SIZE  OF*  ,  15/1  X ,  •  W I TH  COMPUTED  *, 

♦  •SAMPLE  M  E  AN*  * »  F 1 0 .4  » * ,  AND  VAR*  • ,  F10  .4/ IX, 

♦  •THE  TEST  STATISTIC** »F10. 4/ IX, 

♦  •FROM  NUMERATOR*  ,F10. 4, ',  AND  DENOMINATOR**  ,F10. 3) 
RETURN 

ND 


NOF) 


IP!  1  )*  NUMBER  CF  CQMPAR 
RPil  ),.. • »RP ( K )*  HEIGHT 
THE  DENOMINATOR  IS  THE 
HERE  IT  IS  FOUND  EY  SXX 
THE  NUMERATOR  IS  SQAURE 


I, MU  ,  VAR  ) 
‘VAR 


15,21c )  RP!  1  I,LIST(N-I*-1  )  .LIST!  I  )  ,TEMf 
T  { IX,  •RP,LISTHI,LISTLO,T  EMP  *,4F10.4) 
-SLMWT+TEMP 


SUBROUTINE  SMIRN2  (L  1ST , LPTR .TREAT, IP.RP , NOF ) 

C  THE  SUBROUTINE  TAKES  ELEMENTS  OF  TWO  VARIABLES , SORTS 
C  THEM.  ANC  CQMFAIRS  THEIR  EMPIRICAL  CCF'S  (FOUND  BY 
C  DIVIDING  THE  RANK  MINUS  BY  THE  TOTAL  NUMBER  CF  ELEMENTS) 

C  I , M  REFER  TO  FIRST  VAR  X;  J,N  REFER  TO  SECONO  VAR  Y 
C  K  IS  COUNTER  FCR  DIFFERENCE  VECTOR  S .  L»  1  MEANS  LAST 
C  PASS  IP11)  IS  THE  INDEX  OF  HYPOTHESIS  TYPE:  1*H0:  NO  DIF 
C  2*HQ:F!X  )<*G(Y),  3*H0  :F(  X  )*>G!  Y  ) 

REAL  E  PS ILN , FN, FM ,X (100 ) . Y ( 100 ) , DIF , AOI F  ,S ( 200 ) ,TMI N 

♦  TMAX  ,  AT  MAX  .ATM  IN  ,TSUP,PDIr,LIST(400)  ,RP(  50)  ,  TSTAT 
INTEGER  I,J,M,L,M,N,KEYI(100),KEYJ(  100)  ,  JJ,IF(50) 

♦  LPTR  ( 17 )  ,NOF  ,HY  P, II,  TREAT  (17) 

CALL  FINCSUIST.LPTR  ,$,K) 

TMIN-2.0 
TMAX*-2. C 
HYP* IP  C 1 > 

CO  35  1*1, K 

If  ( S(I).gt.tmax)  tmax*S(I) 

IF  !S!  D.LT.TMIN)  TMIN*S!I) 

35  CONTINUE 

ATMI  N*ABS(TMIN) 

ATMAX*ABS  !TMA> ) 

TSUP*ATM  IN 


c 


w 

8  ? 


IF  ( ATMAX.GT. 7SUP  )  TSUP-ATMAX 

WRITE  (NCF,102t  TREAT ( 1)  •  TRE AT( 2  I  »TMIN,TMAX*T$UP 
FORMAT(10X,  ******  KQLMOLGQROV  TEST  ***»**'/lX, 

*•  WITH  DATA  SET  VAR  8', 13.'  AS  VARIABLE  X  '/6X, 

*•  ANC  VAR  Y '  »  13,  '  AS  VARIABLE  Y' /IX, 

*'  THE  MAX  CIFFERENCE  FIXKGI  Y):  1  ,  F10.4/1X, 

*'  THE  MAX  CIFFERENCE  F(  X)  >G(  Yl  :  •  ,FiO.  4/1 X  . 

*'  THE  OVERALL  GREATEST  0  IFF  ERENC  E  : '  ,  F 10.  A  I 
IF(HYP.EC.l)  TSTAT-TSUP 
IF(HYP.E0.2)  TSTAT-TMAX 
IF(HYP.EC.3)  TST  AT-T  M  IN 
WRIT  E(NGF  ,103  )  TSTAT 

FORM  AT(  1  X/1X  »  'TEST  S  TATI  STI C  :  •  ,  F  10,  4  ) 

RETURN 

ENO_  ^ 

'""subrout  TnI”cr  vonm  (  li  st,  lptrTtr!  atT7pTrp7ncf7""*~” 

THIS  SUBFCUT INE  FINDS  THE  TEST  STATISTIC  T2  CF  THE 
RA PER- VON  MISES  TEST,  AND  DOES  LINEAR  INTERPOLATION 
0  ARRIVE  AT  THE  ALPHA  HAT,  ALPHAT. 

REAL  EPSILN,FN,FM,S  (400)  ,SSUM,LSIG,ALPHAT,ALFHYP, 

*  T2  .ATMAX, AT  MN.TSUP, PDIF,  LIST  (400)  ,RP(  50)  ,CCNST 
INTEGER  J  ,K,L  ,M,N,KE  YldOOt  .KEYJdOQ)  ,1 1  ,  JJ ,  I  F( 50 ) , 


inicucK  jiiv,L,n,n,Picuuuuf 

♦  LPTRI17)  ,NQF,I,  TREATU7) 

CALL  FINCS(LIST,LPTR  ,S,K) 
M«LPTR(2)-1 


N»LPTR(3)-LPTF(2) 

CONS  T-FL  CAT  (M*N)  /FLOAT (  (  M+Nt  *(M*N)  I 
ALPHYP»l.C-RP(lt 
S SUM *0.0 
00  35  I  —  1  ,K 

SSUM«SSUMdS(  I  )*S(  I)  ) 

CONTINUE 

T2-CGNST*SSUM 

WRIT E(9, 121 )  P,N , K,C QNST, SSUM.T2 

FORM  AT (1  X  ,  'M  ,  N  ,K  *  CONST,  SaUM,  T2:  •  ,3I4,3F10.4I 

CALL  CRVALF(T2, ALPHAT) 

WRITE  (  NCF  .10  *  I  T2,ALPHAT,ALPHYP 

FORMATd  OX,  ******  CRAMER-VQN  MISES  TEST  ******'/lX, 
*'  THE  TEST  STATISTIC  T2  :  •  .F10.4/ IX, 

*•  GIVES  TFE  ALPHA  HAT  OF: * .  F10.4 /IX, 

*'  COMPARED  TO  THE  HYPOTHESISED  ALPHA ,F10.4/1X, 

♦  •  NULL  HYF  HO:  F(X)«G(X)  FOR  ALL  X',/1X, 

*'  ALT  HYF  HU  F<XK>G(X)  FOR  AT  LEAST  ONE  X*) 

IF  (ALPHVP.LE.ALPHAT)  WRITE  (NQF,111) 


FORMAT(  10 X,  ******  CRAMER-VON  MISES  TEST  ******•/; 
'  THE  TEST  STATISTIC  T2  :  •  .F10.4/ IX, 

•  GIVES  TFE  ALPHA  HAT  OF: * .  F10.4 /IX, 

'  COMPARED  TO  THE  HYPOTHESISED  ALPHA ,F10.4/1X, 

•  NULL  HYF  HO:  F(X)«G(X)  FOR  ALL  X',/1X, 

•  ALT  HYF  Hi:  F<XK>G(X)  FOR  AT  LEAST  ONE  X*) 

IF  (ALPHVP.LE.ALPHAT)  WRITE  (N0F,111) 


ALT  HYF  HI  :  F 
5  (  ALPHVP.LE.ALI 


(NQF 


LEAS  i 

,111) 


( ALPHYP.GT  .ALPHAT )  WRITE  (N0F,112) 


F0RMAT(1. 

FORMATd. 

RETURN 

ENO 


OC  NOT  REJECT  THE  NULL  HYPOTHESIS') 
REJECT  THE  NULL  HYPOTHESIS') 


SUBROUTINE  CR VALF <T2  , ALPHAT  ) 

REAL  W(  14),ALF(14),ALPHAT,PART,T2«0E,NU«LSIG 
w  ( 1 ) -0.0 
V(  2)  *0.  04 
W(  3)  -0.062 
W (4)  -0.079 
W(  5)  *0.  OS 7 

w6t«o.u; 

Vl 7)  -0.147 
W(  8)  -0.  184 
W(9) -0.241 
Wd0l-0.347 
Wdl  J-0.461 
W  ( 12  )  *0 . 7  4  2 
W  (13  t  -1  •  1 6  E 
W(14)«99 


•V. 


ALF  (  1J«0.C 
ALF(2)*0.l 
ALF(  3)*0.2 
ALF(  41*0.3 
ALF<5)*0.4 
ALF(  6)*0. 5 
ALF(  7  )  =  0 .6 
ALF(  8) *0. 7 
AUF(  9)*0.6 
ALF1 10) *0 .9 
ALF(  11)  *  C.  95 
ALF{  12)  *  C .99 
ALF(  13) *C .9995 
ALF(  14) *  1. C 
1*0 
1*1+1 

IF  ( (T2. GT. W(  1 ) ) .AND .  ( I  .LE.  13) )  60  TO  10 
IM  1*1-1 
NU*T2-W(  It'll 
CE*W  ( Il-W  (  IM1  ) 

FART-NU/CE 

LSIG*ALF  ( I  Ml)  +PART*(  ALF  (1)-ALF(  I  PI)  I 
ALPHAT-1  .C-LS  16 

WRITE  (9,131)  I ,  PART  ,T2  ,  ALF(  IM1 )  ,  ALF  ( I ) ,  ALP  HAT , 

*  W(IM1)  •  W  ( 1 1  fMJtOE 

FORMAT!  IX,1 1»f  ART,  T2  ,  ALF- ALF  ,  ALP  HAT*  ,  13,  5F10.4/1X 
*•  W(  IM1 )  ,  W  ( I )  ,NU  ,  OE*  ,4F  10.4 ) 

RETURN 

ENO 

SUBROLTINE  UTEST 


PURPOSE 

TEST  WHETHER  TWO  INOEPENOENT  GROUPS  ARE  FROM  SAM E 
POPULATION  BY  MEANS  OF  MANN-WHITNEY  U-TEST 

USAGE 

CALL  UTEST (A,R ,N1,N2,U,Z, IER) 

DESCR  I.PTICN  OF  PARAMETERS 

A  -  INPUT  VECTOR  OF  CASES  CONSISTING  OF  TWO  INDEPENDENT 
GROUPS.  SMALLER  GROUP  PRECEDES  LARGER  GRCUP.  LENGTH 
IS  N1+N2. 

R  -  OUTPUT  VECTOR  OF  RANKS.  SMALLEST  VALUE  IS  RANKED  1. 

LARGEST  la  RANKEO  N.  TIES  ARE  ASSIGNED  AVERAGE  OF  TIE 
RANKS.  LENGTH  IS  N1  +N2  . 

Ml  -  NUMBER  CF  CASES  IN  SMALLER  GROUP. 

V2  -  NUMBER  OF  CASES  IN  LARGER  GROUP. 

L  -  STATISTIC  USED  TO  TEST  HOMOGENEITY  OF  THE  TWO 
GROUPS  (OUTPUT). 

I  -  MEASURE  CF  SIGNIFICANCE  OF  U  IN  TERMS  OF  NORMAL 
DISTRIBUTION  (OUTPUT). 


IER-  0, 


0,  IF  NC  ERROR. 

1,  IF  ALL  VALUES  OF  ONE  GROUP  ARE  TIED. 

REMARKS 

l  IS  SET  TC  ZERO  IF  N2  IS  LESS  THAN  20 

SUBROUTINES  ANO  FUNCTION  SUBPROGRAMS  REQUIRED 
RJNK 


■THOO 


DESCRIBED  IN  S.  SIEGEL,  'NONPARAMETRIC  STATISTICS  FOR  THE 
BEHAVIORAL  SCIENCES',  MCGRAW-HILL,  NEW  YORK,  1956 
CHAPTER  6 


oo  non  ooo  non  non  non  noon 


'M.v.v.v.v.v.v’.v  ”  11  "  TvT  ■T  r.  ■{■.  ;■ 
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SUBROUTINE  UTEST  (  A  »R  »Nl  »N2»U  *Z*  I  ERi 
DIMENSION  All  )*R(  II 

RANK  SCORES  FROM  EOTH  GROUP  TOGETHER  IN  ASCENDING  ORDER*  A 
ASSIGN  TIEC  OBSERVATIONS  AVERAGE  OF  TIED  RANKS 

N-N1+N2 

CALL  RANK(A,R,N) 

2*0.  0 

SUM  RANKS  IN  LARGER  GROUP 


10  R2 


10 


20 


R2-0.0 
NP-N1+1 
CO  10  I-NP  »N 

~~«R2<  “  “ 


J+R  (  I  > 
CALCULATE 


20 

30 

40 

50 

51 

52 
60 


70 

80 


FNX-N1*N2 

FN-N 

FN2»N2 

UP=FNX*FN2*<  (  FN2  +  1.0  I/2.0I-R2 
U*FN  X-UP 

IF(UP-U)  2C,3C,30 
U-UP 

TEST  FCR  N t  LESS  THAN  20 
IFCN2-20I  80*40*40 

COMPUTE  STANDARD  DEVIATION 
KT»l 

CALL  TI E ( P  *N »  NT* TS) 

iF<Tfs-??IS*FA*FN-FNI/12l52,51*52 

IER-1 

60  TO  80 

S»SQRT(  (  FNX/C  FN*  (FN-1.01)  )*(  < <FN*FN*FN-FN  J/12.0J-TSI  I 
GO  TO  70 

S-SQRTI  FNX*( F  N*1  .0  )/  12.  0  ) 

COMPUTE  2 

Z*<U-FNX*C.5| /S 

RETURN 

FND 

SUBROUTINE  RANK! A *R*  N I 
DIMENSION  A(  1 )  *R  (  1 ) 

INITIALIZATION 
00  10  I  *  1  »N 
R ( I I *0# 0 

F I NO  RANK  CF  OATA 
00  100  I  *  1  »N 
TEST  WHETHER 
IFIRTim  2C» 

DATA  PCINT 
SMALL-0.0 
EQUAL-0.  C 
X-A<  II 
00  50  J-l  ,N 
IFiA(JI-X)  30*  40,  5 C 

COUNT  NUMBER  OF  DATA  POINTS  WHICH  ARE  SMALLER 


„  DATA 
20*  100 
TO  BE  RANKED 


POINT  IS  ALREADY  RANKED 


209 


30  SMALL*SMALL*1 .0 
GO  TC  50 

C CUNT  NUMBER  OF  DATA  POINTS  WHICH  ARE  EQUAL 
40  EQUAL«EQUAl+l  ,0 
R(  Jl  —1.  C 
50  CONTINUE 

TEST  FCR  TIE 
IF(EQUAL-l.O)  60,  60,  70 

STORE  RANK  OF  CAT  A  PCINT  WHERE  NO  TIE 

60  RU) *SMALl«l.C 
GO  TC  IOC 

CALCULATE  RANK  OF  TI  ED  OATA  POINTS 
70  P*SMALL  ♦  (EQLAL  ♦  1.01*0.5 
CO  9 C  J*  I  »N 

IF  (R  ( J)  + 1 .0 )  50,  80,  90 
80  R(  J)  »P 
90  CONTINUE 
100  CONTINUE 
RETURN 
END 

SUBROuf  INE*TiE<R*N*KT*T ) . 

CIMENSI ON  R ( 1 J 
INITIALIZATION 
T»0.  0 
Y*0.  0 

5  X*l©  0E38 
INOO 

FIND  NEXT  LARGEST  RANK 


10  IF(RUI-X) 
20  X-R< I» 

INO  IND  +  1 
30  CONTINUE 


30,30,10 

20,30,30 


IF (INO) 
40  Y*X 


ALL  RANKS  HAVE  BEEN  TBSTEO ,  RETURN 
I  9C, 50,40 


CT«Q .0 

C  CUNT  TIES 

?9(R(II-x|N60,50,60 

60  JoJSInU^0 

CALCULATE  CORRECTION  FACTOR 
IF  <C  TI  70.5.7C 
70  !f{kwi  7 5 , 8 C ,75 
75  T«T±CT*(CT-I.  1/2.0 


GO  TC  5 
80  T*T+(CT* 
GO  TC  5 


CT*CT-CT 1/12.0 
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